Phi-4 polypeptides and methods for their use

ABSTRACT

Compositions and methods for controlling pests are provided. The methods involve transforming organisms with a nucleic acid sequence encoding an insecticidal protein. In particular, the nucleic acid sequences are useful for preparing plants and microorganisms that possess insecticidal activity. Thus, transformed bacteria, plants, plant cells, plant tissues and seeds are provided. Compositions are insecticidal nucleic acids and proteins of bacterial species. The sequences find use in the construction of expression vectors for subsequent transformation into organisms of interest, as probes for the isolation of other homologous (or partially homologous) genes. The insecticidal proteins find use in controlling, inhibiting growth or killing lepidopteran, coleopteran, dipteran, fungal, hemipteran, and nematode pest populations and for producing compositions with insecticidal activity.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. Non Provisional application Ser. No. 13/839,702 filed Mar. 15, 2013 the contents of which are herein incorporated by reference in their entirety.

REFERENCE TO SEQUENCE LISTING SUBMITTED ELECTRONICALLY

The official copy of the sequence listing is submitted electronically via EFS-Web as an ASCII formatted sequence listing with a file named “5192USCNT_Sequence_Listing” created on Jun. 8, 2016, and having a size of 3632 kilobytes and is filed concurrently with the specification. The sequence listing contained in this ASCII formatted document is part of the specification and is herein incorporated by reference in its entirety.

LENGTHY TABLES

The patent application contains a lengthy table section as an ASCII formatted document, which is part of the specification and is herein incorporated by reference in its entirety. A copy of the tables are available in electronic form from the USPTO web site (http://seqdata.uspto.gov/). An electronic copy of the table will also be available from the USPTO upon request and payment of the fee set forth in 37 CFR 1.19(b)(3)

FIELD OF THE INVENTION

This disclosure relates to the field of molecular biology. Provided are novel genes that encode pesticidal proteins. These pesticidal proteins and the nucleic acid sequences encoding them are useful in preparing pesticidal formulations and in the production of transgenic pest-resistant plants.

BACKGROUND OF THE INVENTION

Biological control of insect pests of agricultural significance using a microbial agent, such as fungi, bacteria, or another species of insect affords an environmentally friendly and commercially attractive alternative to synthetic chemical pesticides. Generally speaking, the use of biopesticides presents a lower risk of pollution and environmental hazards, and biopesticides provide greater target specificity than is characteristic of traditional broad-spectrum chemical insecticides. In addition, biopesticides often cost less to produce and thus improve economic yield for a wide variety of crops.

Certain species of microorganisms of the genus Bacillus are known to possess pesticidal activity against a range of insect pests including Lepidoptera, Diptera, Coleoptera, Hemiptera and others. Bacillus thuringiensis (Bt) and Bacillus popilliae are among the most successful biocontrol agents discovered to date. Insect pathogenicity has also been attributed to strains of B. larvae, B. lentimorbus, B. sphaericus and B. cereus. Microbial insecticides, particularly those obtained from Bacillus strains, have played an important role in agriculture as alternatives to chemical pest control.

Crop plants have been developed with enhanced insect resistance by genetically engineering crop plants to produce pesticidal proteins from Bacillus. For example, corn and cotton plants have been genetically engineered to produce pesticidal proteins isolated from strains of Bt. These genetically engineered crops are now widely used in agriculture and have provided the farmer with an environmentally friendly alternative to traditional insect-control methods. While they have proven to be very successful commercially, these genetically engineered, insect-resistant crop plants provide resistance to only a narrow range of the economically important insect pests. In some cases, insects can develop resistance to different insecticidal compounds, which raises the need to identify alternative biological control agents for pest control.

Accordingly, there remains a need for new pesticidal proteins with different ranges of insecticidal activity against insect pests, e.g., insecticidal proteins which are active against a variety of insects in the order Lepidoptera and the order Hemiptera including but not limited to species belonging to the family Pentatomidae, the family Plataspidae and the family Cydnidae. In addition, there remains a need for biopesticides having activity against a variety of insect pests that have developed resistance to existing pesticides.

SUMMARY OF THE INVENTION

Compositions and methods for conferring pesticidal activity to bacteria, plants, plant cells, tissues and seeds are provided. Compositions include nucleic acid molecules encoding sequences for pesticidal and insecticidal polypeptides, vectors comprising those nucleic acid molecules, and host cells comprising the vectors. Compositions also include the pesticidal polypeptide sequences and antibodies to those polypeptides. The nucleic acid sequences can be used in DNA constructs or expression cassettes for transformation and expression in organisms, including microorganisms and plants. The nucleotide or amino acid sequences may be synthetic sequences that have been designed for expression in an organism including, but not limited to, a microorganism or a plant. Compositions also comprise transformed bacteria, plants, plant cells, tissues and seeds.

In particular, isolated or recombinant nucleic acid molecules are provided encoding PHI-4 polypeptides including amino acid substitutions, amino acid deletions, amino acid insertions, and fragments thereof, and combinations thereof. Additionally, amino acid sequences corresponding to the PHI-4 polypeptides are encompassed. Nucleic acid sequences that are complementary to a nucleic acid sequence of the embodiments, or that hybridize to a sequence of the embodiments are also encompassed.

Methods are provided for producing the polypeptides and for using those polypeptides for controlling, inhibiting growth or killing a Lepidopteran, Coleopteran, nematode, fungi, Hemipteran and/or Dipteran pests. The transgenic plants of the embodiments express one or more of the pesticidal sequences disclosed herein. In various embodiments, the transgenic plant further comprises one or more additional genes for insect resistance, for example, one or more additional genes for controlling coleopteran, lepidopteran, hemipteran or nematode pests. It will be understood by one of skill in the art that the transgenic plant may comprise any gene imparting an agronomic trait of interest.

Methods for detecting the nucleic acids and polypeptides of the embodiments in a sample are also included. A kit for detecting the presence of a PHI-4 polypeptide or detecting the presence of a nucleotide sequence encoding a PHI-4 polypeptide in a sample is provided. The kit is provided along with all reagents and control samples necessary for carrying out a method for detecting the intended agent, as well as instructions for use.

The compositions and methods of the embodiments are useful for the production of organisms with enhanced pest resistance or tolerance. These organisms and compositions comprising the organisms are desirable for agricultural purposes. The compositions of the embodiments are also useful for generating altered or improved proteins that have pesticidal activity, or for detecting the presence of PHI-4 polypeptides or nucleic acids in products or organisms.

The following embodiments are encompassed by the present disclosure.

Embodiment 1 is a PHI-4 polypeptide having improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 2 is the PHI-4 polypeptide of embodiment 1, wherein the insecticidal activity is increased about 1.5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 3 is the PHI-4 polypeptide of embodiment 1, wherein the insecticidal activity is increased about 2 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 4 is the PHI-4 polypeptide of embodiment 1, wherein the insecticidal activity is increased about 2.5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 5 is the PHI-4 polypeptide of embodiment 1, wherein the insecticidal activity is increased about 3 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 6 is the PHI-4 polypeptide of embodiment 1, wherein the insecticidal activity is increased about 5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 7 is the PHI-4 polypeptide of any one of embodiments 1-6, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is against Western Corn Root Worm (WCRW) larvae.

Embodiment 8 is the PHI-4 polypeptide of any one of embodiments 1-7, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as a Mean FAE Index.

Embodiment 9 is the PHI-4 polypeptide of any one of embodiments 1-7, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as an EC50 value.

Embodiment 10 is the PHI-4 polypeptide of any one of embodiments 1-7, wherein the improved activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as a Mean Deviation Score.

Embodiment 11 is the PHI-4 polypeptide of any one of embodiments 1-10, wherein the PHI-4 polypeptide comprises one or more amino acid substitutions compared to the native amino acid at position 40, 42, 43, 46, 52, 97, 98, 99, 145, 150, 151, 153, 163, 171, 172, 182, 196, 206, 210, 216, 220, 278, 283, 289, 293, 328, 333, 334, 336, 338, 339, 342, 346, 354, 355, 370, 389, 393, 396, 401, 402, 403, 410, 412, 416, 417, 426, 442, 447, 452, 454, 455, 457, 461, 462, 500, 509, 520 or 527 of SEQ ID NO: 35.

Embodiment 12 is the PHI-4 polypeptide of embodiment 11, further comprising one or more amino acid substitutions at position 86, 359, 464, 465, 466, 467, 468, 499 or 517.

Embodiment 13 is the PHI-4 polypeptide of embodiment 11 or 12, wherein the amino acid at position 86 is Glu or Thr; the amino acid at position 359 is Gly or Ala; the amino acid at position 464 is Arg, Ala, Lys, Asp or Asn; the amino acid at position 465 is Lys or Met, the amino acid at position 467 is Val, Ala, Leu or Thr; the amino acid at position 468 is Ser or Leu; the amino acid at position 499 is Glu or Ala, or the amino acid at position 517 is Glu or Arg.

Embodiment 14 is the PHI-4 polypeptide of embodiment 11, 12 or 13, wherein the amino acid at position 40 is Leu or Ile; the amino acid at position 42 is Asp or Asn; the amino acid at position 43 is Phe or Glu; the amino acid at position 46 is Glu or Asn; the amino acid at position 52 is Ile or Val; the amino acid at position 97 is Arg, Asp, Glu or Asn; the amino acid at position 98 is Tyr or Phe; the amino acid at position 99 is Lys or Leu; the amino acid at position 145 is Leu or Val; the amino acid at position 150 is Arg or Gln; the amino acid at position 151 is Asp or Ser; the amino acid at position 153 is Leu or Ile; the amino acid at position 163 is Leu or Val; the amino acid at position 171 is Tyr or Phe; the amino acid at position 172 is Ile or Leu; the amino acid at position 182 is Asp or Gln; the amino acid at position 196 is Gln or Asn; the amino acid at position 206 is Tyr or Phe; the amino acid at position 210 is Val or Ile; the amino acid at position 216 is Glu or Gln; the amino acid at position 220 is Glu, Gln, His or Asp; the amino acid at position 278 is Glu or Asn; the amino acid at position 283 is Ile or Val; the amino acid at position 289 is Lys, Gln or Leu; the amino acid at position 293 is Arg, Gln or Glu; the amino acid at position 328 is Lys or Glu; the amino acid at position 333 is Ser, Lys or Val; the amino acid at position 334 is Gly, Lys or Arg; the amino acid at position 336 is Gly or Ala; the amino acid at position 338 is Ser or Val; the amino acid at position 339 is Glu, Asn or Gln; the amino acid at position 342 is Ala or Ser; the amino acid at position 346 is Pro or Ala; the amino acid at position 354 is Met or Leu; the amino acid at position 355 is Val or Ile; the amino acid at position 370 is His or Arg; the amino acid at position 389 is Trp or Leu; the amino acid at position 393 is Trp or Leu; the amino acid at position 396 is Ala, Leu, Lys, Thr or Gly; the amino acid at position 401 is Ser, His, Gly, Lys or Pro; the amino acid at position 402 is Lys, His, Gly or Trp; the amino acid at position 403 is Asp or Tyr; the amino acid at position 410 is Ile or Val; the amino acid at position 412 is Pro or Ala; the amino acid at position 416 is Arg or Glu; the amino acid at position 417 is Ala or Ser; the amino acid at position 426 is Thr or Ser; the amino acid at position 442 is Gln or Glu; the amino acid at position 447 is Asp or Lys; the amino acid at position 452 is Gln or Lys; the amino acid at position 454 is Arg or Gln; the amino acid at position 455 is Val or Ile; the amino acid at position 457 is Trp or Asn; the amino acid at position 461 is Thr or Ser; the amino acid at position 462 is Gly or Ala; the amino acid at position 500 is Arg or Gln; the amino acid at position 509 is Lys or Gln; the amino acid at position 520 is Lys, Glu or Gln; and the amino acid at position 527 is Gln or Lys.

Embodiment 15 is the PHI-4 polypeptide of any one of embodiments 1-11 and 14, having 1 to 54 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 16 is the PHI-4 polypeptide of any one of embodiments 1-11 and 14, having 1 to 27 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 17 is the PHI-4 polypeptide of any one of embodiments 1-11 and 14, having 1 to 20 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 18 is the PHI-4 polypeptide of any one of embodiments 1-11 and 14, having 1 to 15 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 19 is the PHI-4 polypeptide of any one of embodiments 12 or 13, comprising 2 to 54 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 20 is the PHI-4 polypeptide of any one of embodiments 12 or 13, comprising 2 to 27 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 21 is the PHI-4 polypeptide of any one of embodiments 12 or 13, comprising 2 to 20 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 22 is the PHI-4 polypeptide of any one of embodiments 12 or 13, comprising 2 to 15 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 23 is the PHI-4 polypeptide of any one of embodiments 1-22, wherein the PHI-4 polypeptide has at least 80% identity to SEQ ID NO: 35.

Embodiment 24 is the PHI-4 polypeptide of any one of embodiments 1-22, wherein the PHI-4 polypeptide has at least 90% identity to SEQ ID NO: 35.

Embodiment 25 is the PHI-4 polypeptide of any one of embodiments 1-22, wherein the PHI-4 polypeptide has at least 95% identity to SEQ ID NO: 35.

Embodiment 26 is the PHI-4 polypeptide of any one of embodiments 1-22, wherein the PHI-4 polypeptide has at least 97% identity to SEQ ID NO: 35.

Embodiment 27 is a PHI-4 polypeptide, comprising at least one amino acid substitution at a residue relative to SEQ ID NO: 35 in structural domain selected from:

-   -   a hydrophilic residue;     -   a residue in a membrane insertion initiation loop;     -   a residue in a receptor binding loop; and     -   a residue in a protease sensitive region,

wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35.

Embodiment 28 is the PHI-4 polypeptide of embodiment 27, wherein the hydrophilic residues are Asp, Glu, Lys, Arg, His, Ser, Thr, Tyr, Trp, Asn, Gln, and Cys.

Embodiment 29 is the PHI-4 polypeptide of embodiment 27 or 28, wherein the membrane insertion loops are between about amino acid at position 92 (Val) and 101 (Ala) and at position 211 (Gly) and 220 (Glu) relative to SEQ ID NO: 35.

Embodiment 30 is the PHI-4 polypeptide of embodiment 29, wherein the membrane insertion initiation loop residue is selected from position 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220 of SEQ ID NO: 35.

Embodiment 31 is the PHI-4 polypeptide of any one of embodiments 27-30, wherein the receptor binding loops are between about amino acid 332 (Asp) and 340 (Asp), 395 (Asp) and 403 (Asp), 458 (Asp) and 466 (Asp) relative to SEQ ID NO: 35.

Embodiment 32 is the PHI-4 polypeptide of embodiment 31, wherein the receptor binding loop residue is selected from positions 332, 333, 334, 335, 336, 337, 338, 339, 340, 395, 396, 397, 398, 399, 400, 401, 402, 403, 458, 459, 460, 461, 462, 463, 464, 465, 466 of SEQ ID NO: 35.

Embodiment 33 is the PHI-4 polypeptide of any one of embodiments 27-32, wherein the protease sensitive region residue is selected from about amino acid residues between 305 (Lys) and 316 (Lys) and 500 (Arg) and 535 (Lys) relative to SEQ ID NO: 35.

Embodiment 34 is the PHI-4 polypeptide of embodiment 27, wherein the protease is trypsin.

Embodiment 35 is the PHI-4 polypeptide of any one of embodiments 27-34, wherein the PHI-4 polypeptide has at least 80% sequence identity to SEQ ID NO: 35.

Embodiment 36 is the PHI-4 polypeptide of any one of embodiments 27-34, wherein the PHI-4 polypeptide has at least 90% sequence identity to SEQ ID NO: 35.

Embodiment 37 is the PHI-4 polypeptide of any one of embodiments 27-34, wherein the PHI-4 polypeptide has at least 95% sequence identity to SEQ ID NO: 35.

Embodiment 38 is the PHI-4 polypeptide of any one of embodiments 27-34, wherein the PHI-4 polypeptide has at least 97% sequence identity to SEQ ID NO: 35.

Embodiment 39 is a PHI-4 polypeptide, comprising an amino acid sequence of the formula,

(SEQ ID NO: 3)                 5                   10                  15 Met Xaa Ser Ala Ala Asn Ala Gly Xaa Leu Gly Asn Leu Xaa Gly                 20                  25                  30 Xaa Thr Ser Xaa Gly Met Xaa Tyr Xaa Val Asn Gly Leu Tyr Ala                 35                  40                  45 Ser Pro Glu Ser Leu Xaa Gly Gln Pro Leu Phe Xaa Xaa Gly Gly                 50                  55                  60 Xaa Leu Asp Ser Xaa Xaa Ile Glu Gly Xaa Xaa Xaa Xaa Phe Pro                 65                  70                  75 Xaa Ser Met His Val His Thr Tyr Phe His Ser Asp Xaa Xaa Gln                 80                  85                  90 Xaa Val Ser Xaa Xaa Ile Xaa Xaa Xaa Arg Xaa Xaa Xaa Ser Xaa                 95                  100                 105 His Val Gly Xaa Ser Gly Xaa Xaa Xaa Leu Phe Ser Xaa Ser Xaa                 110                 115                 120 Ser Val Asp Xaa Thr Thr Xaa Xaa Gln Gln Leu Xaa Glu Ile Thr                 125                 130                 135 Xaa Ser Ser Thr Arg Glu Xaa His Val Leu Trp Tyr Ile Ser Leu                 140                 145                 150 Pro Gly Ala Ala Thr Leu Xaa Ser Met Leu Xaa Xaa Xaa Xaa Xaa                 155                 160                 165 Xaa Asp Xaa Xaa Xaa Pro Asn Met Xaa Ala Met Xaa Leu Phe Xaa                 170                 175                 180 Xaa Xaa Gly Pro Xaa Xaa Xaa Xaa Xaa Ala Ala Val Gly Gly Arg                 185                 190                 195 Leu Xaa Xaa Xaa Xaa Ala Ser Lys Xaa Leu Xaa Met Xaa Ser Ser                 200                 205                 210 Xaa Ser Leu Ser Thr Thr Xaa Xaa Xaa Ser Xaa Xaa Ala Xaa Xaa                 215                 220                 225 Gly Glu Ile Xaa Ile Xaa His Gly Ser Xaa Met Glu Lys Gln Val                 230                 235                 240 Asn Ser Phe Xaa Xaa Xaa Ser Thr Ile Arg Xaa Thr Ala Thr Gly                 245                 250                 255 Gly Lys Pro Gly Xaa Thr Xaa Arg Ile Leu His Gly Pro Asp Ser                 260                 265                 270 Xaa Xaa Ala Phe Ser Xaa Trp Ala Xaa Ser Leu Leu Xaa Tyr Ala                 275                 280                 285 Thr Leu Met Asp Phe Xaa Thr Xaa Ser Leu Xaa Xaa Ile Xaa Ala                 290                 295                 300 Leu Xaa Asp Xaa Pro Xaa Xaa Xaa Xaa Glu Xaa Xaa Xaa Ala Xaa                 305                 310                 315 Pro Xaa Xaa Met Xaa Xaa Ser Gln Xaa Ser Ile Pro Xaa Val Asp                 320                 325                 330 Xaa Val Leu Leu Met Asp Ala Arg Pro Pro Met Val Xaa Ala Gly                 335                 340                 345 Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa                 350                 355                 360 Xaa Ser Thr Ser Xaa Xaa Tyr Lys Xaa Xaa Gly Gln Phe Xaa Gln                 365                 370                 375 Arg Xaa His Xaa Ser Val Ala Asp Gly His Xaa Pro Ile Xaa Xaa                 380                 385                 390 Asp Leu Phe Asp Xaa Gly Xaa Xaa Xaa Xaa Pro Val Gly Xaa Gln                 395                 400                 405 Xaa Val Trp Asp Xaa Xaa Xaa Xaa Gly Lys Xaa Xaa Xaa Tyr Xaa                 410                 415                 420 Cys Trp Arg Xaa Xaa Xaa Xaa Gln Gly Tyr Xaa Xaa Xaa Gly Asp                 425                 430                 435 Val Xaa Met Leu Ala Xaa Ser Gly Tyr Asn Pro Pro Asn Leu Pro                 440                 445                 450 Xaa Xaa Xaa Cys Xaa His Xaa Ser Leu Xaa Ala Xaa Xaa Xaa Thr                 455                 460                 465 Leu Xaa Xaa Xaa Xaa Trp Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa                 470                 475                 480 Xaa Val Ser Leu Trp Xaa Pro Gly Ala Ala Gly Ala Val Ala Ser                 485                 490                 495 Ser Cys Phe Ala Gly Val Pro Asn Tyr Asn Asn Pro Pro Asn Ser                 500                 505                 510 Gly Xaa Ile Xaa Xaa Leu Xaa Gly Ser Ile Ala Cys Val Xaa Thr                 515                 520                 525 Ser Ala Ile Ala Ser Met Xaa Xaa Met Xaa Ser Met Leu Ser Xaa                 530                 535 His Xaa Gly Met Glu Ala Met Met Ser Lys Leu, wherein Xaa at position 2 is Ala or Arg; Xaa at position 9 is Gln, Lys or Glu; Xaa at position 14 is Pro or Ala; Xaa at position 16 is Val or Asp; Xaa at position 19 is Met or Leu; Xaa at position 22 is Gly or Ser; Xaa at position 24 is Asp, Asn or Gln; Xaa at position 36 is Leu or Met; Xaa at position 42 is Asp, Asn or Gln; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu, Asp, Asn or Gly; Xaa at position 50 is Ile or Val; Xaa at position 51 is Glu or Gln; Xaa at position 55 is Arg or Lys; Xaa at position 56 is Ser or Thr; Xaa at position 57 is Tyr or Phe; Xaa at position 58 is Thr or Ser; Xaa at position 61 is Arg, Lys or Glu; Xaa at position 73 is Phe or Tyr; Xaa at position 74 is Lys, Glu, Gly, Arg, Met, Leu, His or Asp; Xaa at position 76 is Asp or Gln; Xaa at position 79 is Lys or Glu; Xaa at position 80 is Glu or Ser; Xaa at position 82 is Glu, Ile, Leu, Tyr or Gln; Xaa at position 83 is Glu or Gln; Xaa at position 84 is Tyr or Phe; Xaa at position 86 is Glu or Gln; Xaa at position 87 is Lys or Gln; Xaa at position 88 is Met, Ile or Leu; Xaa at position 90 is Gln or Glu; Xaa at position 94 is Val or Ile; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln, Gly or Ser; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile, Met, Phe, Cys, Val or Asn; Xaa at position 103 is Ala or Gly; Xaa at position 105 is Leu or Ile; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp, Asn, Glu, Cys, Ala or Arg; Xaa at position 112 is Thr or Ser; Xaa at position 113 is Asp, Glu or Met; Xaa at position 117 is Thr or Ser; Xaa at position 121 is Tyr or Phe; Xaa at position 127 is Ala or Thr; Xaa at position 142 is Arg or Glu; Xaa at position 146 is Arg or Gln; Xaa at position 147 is Arg, Glu or Gln; Xaa at position 148 is Asp, Phe, Pro, Val, Glu, His, Trp, Ala, Arg, Leu, Ser, Gln or Gly; Xaa at position 149 is Phe or Val; Xaa at position 150 is Arg, Gln or Glu; Xaa at position 151 is Asp, Ser, Ala, Asn, Trp, Val, Gln, Cys, Met, Leu, Arg or Glu; Xaa at position 153 is Leu or Ile; Xaa at position 154 is Asn or Asp; Xaa at position 155 is Asn or Lys; Xaa at position 159 is Pro or Asp; Xaa at position 162 is Glu, Asp or Gln; Xaa at position 165 is Lys, Glu, Gln, Pro, Thr, Ala, Leu, Gly, Asp, Val, His, Ile, Met, Trp, Phe, Tyr or Arg; Xaa at position 166 is Arg or Gln; Xaa at position 167 is Tyr, Trp or Cys; Xaa at position 170 is Tyr or His; Xaa at position 171 is Tyr or Phe; Xaa at position 172 is Ile, Leu or Val; Xaa at position 173 is Ser or Ala; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 183 is Tyr or Val; Xaa at position 184 is Ser or Thr; Xaa at position 185 is Ala or Ser; Xaa at position 189 is Thr, Lys or Ile; Xaa at position 191 is Lys or Gln; Xaa at position 193 is Asp or Asn; Xaa at position 196 is Gln, Lys, Asn, Asp, Glu, Ala, Ile or Arg; Xaa at position 202 is Ala or Val; Xaa at position 203 is Glu, Thr or His; Xaa at position 204 is Met or Ala; Xaa at position 206 is Tyr or Phe; Xaa at position 207 is Lys or Gln; Xaa at position 209 is Leu or Pro; Xaa at position 210 is Val or Ile; Xaa at position 214 is Lys, Ser or Gln; Xaa at position 216 is Glu, Gln, Phe, Val, Tyr or Arg; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser, Gln, Arg, Trp, Met, Ala, Phe, Ile, Leu, Cys or Asn; Xaa at position 229 is Arg or Glu; Xaa at position 230 is Ser or Glu; Xaa at position 231 is Asn or Ser; Xaa at position 236 is Leu or Pro; Xaa at position 245 is Met or Leu; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln, Lys or Glu; Xaa at position 257 is Gln, Ile, Glu, Cys, Ser, His, Trp or Met; Xaa at position 261 is Gln, Glu or Lys; Xaa at position 264 is Glu or Gln; Xaa at position 268 is Asp or Asn; Xaa at position 276 is Ser or Ala; Xaa at position 278 is Glu, Asn or Gln; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 282 is Pro or Gly; Xaa at position 284 is Trp or Arg; Xaa at position 287 is Ala or Cys; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr, Asp, Phe, Ser, Met, Arg, Trp, Ile, His, Asn, Cys, Gly or Ala; Xaa at position 291 is Glu or Gln; Xaa at position 292 is Arg or Gln; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 294 is Val or Ala; Xaa at position 296 is Leu or Ile; Xaa at position 297 is Glu or Gln; Xaa at position 298 is Asp or Gln; Xaa at position 300 is Phe or Tyr; Xaa at position 302 is Glu or Gln; Xaa at position 303 is Phe or Tyr; Xaa at position 305 is Lys or Gln; Xaa at position 306 is Gln or Lys; Xaa at position 309 is Gln, Lys or Glu; Xaa at position 313 is Lys, Gln or Arg; Xaa at position 316 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 331 is Glu, Asn or Gln; Xaa at position 333 is Ser, Arg, Gly, Lys, Val, Asn, Ala, His, Gln, Thr, Asp, Ile, Leu, Cys or Glu; Xaa at position 334 is Gly, Arg, Lys, Ile or Trp; Xaa at position 335 is Ser or Ala; Xaa at position 336 is Gly or Ala; Xaa at position 337 is Ala, Val or Gly; Xaa at position 338 is Ser, His, Val, Lys, Ala, Gly, Thr, Ile, Glu, Met, Arg, Pro, Asp, Asn or Leu; Xaa at position 339 is Glu, Asn, Gln, Ile, Pro, Met, Ser, Ala, Cys, Phe, Val, Leu, Asp, Trp, His or Arg; Xaa at position 341 is Leu or Val; Xaa at position 342 is Ala, Ser or Val; Xaa at position 343 is Val or Ile; Xaa at position 344 is Phe or Trp; Xaa at position 345 is Asn or His; Xaa at position 346 is Pro or Ala; Xaa at position 350 is Asn or Ser; Xaa at position 351 is Gly or Val; Xaa at position 354 is Met or Leu; Xaa at position 355 is Val, Ile or Leu; Xaa at position 359 is Gly or Ala; Xaa at position 362 is Asn or Ser; Xaa at position 364 is Ala or Ser; Xaa at position 371 is Ala, Gly or Thr; Xaa at position 374 is Phe or Ile; Xaa at position 375 is Lys or Arg; Xaa at position 380 is Leu or Gly; Xaa at position 382 is Val, Asp or Leu; Xaa at position 383 is Leu, Ile or Val; Xaa at position 384 is Lys, Ala or Gly; Xaa at position 385 is Ala or Gly; Xaa at position 389 is Trp or Tyr; Xaa at position 391 is Arg, Leu, Glu, Gln or Asp; Xaa at position 395 is Asp or Cys; Xaa at position 396 is Ala, Leu, Lys, Asn, Gly, Ile, Met, Arg, Tyr, Gln or His; Xaa at position 397 is Gly, Arg or Ala; Xaa at position 398 is Ser, Gln or Cys; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val, Arg, Ile, Asn, Phe, Thr, Ala, Asp, Met, Gln or Glu; Xaa at position 402 is Lys, Phe, His, Arg, Trp, Gly, Asn, Leu, Tyr, Thr, Val, Met, Pro or Ala; Xaa at position 403 is Asp, Tyr, Trp, Phe or Glu; Xaa at position 405 is Ala or Ser; Xaa at position 409 is Ala or Pro; Xaa at position 410 is Ile or Val; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg, Glu or Gln; Xaa at position 417 is Ala, Ser or Cys; Xaa at position 418 is Leu or Met; Xaa at position 422 is Met or Val; Xaa at position 426 is Thr or Ser; Xaa at position 436 is Asp or Lys; Xaa at position 437 is Tyr or Val; Xaa at position 438 is Val or Arg; Xaa at position 440 is Val or Leu; Xaa at position 442 is Gln, Lys or Glu; Xaa at position 445 is Cys, Leu or Thr; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu, Ile, Gly, Pro, Leu, Phe, Trp or Thr; Xaa at position 448 is Val or Ala; Xaa at position 449 is Gln or Glu; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 453 is Asn or Asp; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp, Gln, Gly, His, Asp, Leu, Thr, Pro or Asn; Xaa at position 455 is Val or Ile; Xaa at position 457 is Trp or Asn; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile, Thr, Ser, His, Cys, Tyr, Pro, Asn, Ala, Arg or Glu; Xaa at position 460 is Gly or Ala; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala, Ser or Gly; Xaa at position 464 is Arg, Gly, His, Gln, Thr or Phe; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly, Arg, Thr, His, Cys, Trp, Phe or Leu; Xaa at position 466 is Asp or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 497 is Asp or Gln; Xaa at position 499 is Glu or Gln; Xaa at position 500 is Arg, Gln or Lys; Xaa at position 502 is Arg, Glu or Gln; Xaa at position 509 is Lys, Gln or Glu; Xaa at position 517 is Gln, Cys, Asn, Val or Pro; Xaa at position 518 is Glu or Gln; Xaa at position 520 is Lys, Gln or Glu; Xaa at position 525 is Gln or Lys; and Xaa at position 527 is Gln, Lys, Pro, Cys, Glu, Ser, His, Phe or Trp; wherein one or more amino acid(s) designated by Xaa in SEQ ID NO: 3 is an amino acid different from the corresponding amino acid of SEQ ID NO: 35 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35.

Embodiment 40 is a PHI-4 polypeptide, comprising an amino acid sequence of the formula

(SEQ ID NO: 4)                 5                   10                  15 Met Xaa Ser Ala Ala Asn Ala Gly Gln Leu Gly Asn Leu Pro Gly                 20                  25                  30 Val Thr Ser Met Gly Met Gly Tyr Xaa Val Asn Gly Leu Tyr Ala                 35                  40                  45 Ser Pro Glu Ser Leu Leu Gly Gln Pro Leu Phe Xaa Xaa Gly Gly                 50                  55                  60 Xaa Leu Asp Ser Ile Glu Ile Glu Gly Arg Ser Tyr Thr Phe Pro                 65                  70                  75 Arg Ser Met His Val His Thr Tyr Phe His Ser Asp Phe Xaa Gln                 80                  85                  90 Asp Val Ser Xaa Glu Ile Xaa Glu Tyr Arg Glu Lys Met Ser Gln                 95                  100                 105 His Val Gly Val Ser Gly Xaa Xaa Xaa Leu Phe Ser Ala Ser Leu                 110                 115                 120 Ser Val Asp Xaa Thr Thr Thr Asp Gln Gln Leu Thr Glu Ile Thr                 125                 130                 135 Tyr Ser Ser Thr Arg Glu Ala His Val Leu Trp Tyr Ile Ser Leu                 140                 145                 150 Pro Gly Ala Ala Thr Leu Arg Ser Met Leu Arg Xaa Xaa Phe Xaa                 155                 160                 165 Xaa Asp Xaa Asn Asn Pro Asn Met Pro Ala Met Xaa Leu Phe Xaa                 170                 175                 180 Xaa Tyr Gly Pro Tyr Xaa Ile Ser Xaa Ala Ala Val Gly Gly Arg                 185                 190                 195 Leu Xaa Tyr Ser Ala Ala Ser Lys Thr Leu Lys Met Asp Ser Ser                 200                 205                 210 Xaa Ser Leu Ser Thr Thr Ala Xaa Met Ser Xaa Lys Ala Leu Val                 215                 220                 225 Gly Glu Ile Lys Ile Xaa His Gly Ser Xaa Met Glu Lys Gln Val                 230                 235                 240 Asn Ser Phe Arg Ser Asn Ser Thr Ile Arg Leu Thr Ala Thr Gly                 245                 250                 255 Gly Lys Pro Gly Met Thr Xaa Arg Ile Leu His Gly Pro Asp Ser                 260                 265                 270 Xaa Xaa Ala Phe Ser Xaa Trp Ala Glu Ser Leu Leu Asp Tyr Ala                 275                 280                 285 Thr Leu Met Asp Phe Ser Thr Xaa Ser Leu Xaa Pro Ile Trp Ala                 290                 295                 300 Leu Ala Asp Xaa Pro Glu Arg Xaa Val Glu Leu Glu Asp Ala Phe                 305                 310                 315 Pro Glu Phe Met Lys Gln Ser Gln Gln Ser Ile Pro Xaa Val Asp                 320                 325                 330 Lys Val Leu Leu Met Asp Ala Arg Pro Pro Met Val Xaa Ala Gly                 335                 340                 345 Glu Asp Xaa Xaa Ser Xaa Ala Xaa Xaa Asp Leu Ala Xaa Phe Asn                 350                 355                 360 Xaa Ser Thr Ser Asn Gly Tyr Lys Met Xaa Gly Gln Phe Xaa Gln                 365                 370                 375 Arg Asn His Ala Ser Val Ala Asp Gly His Ala Pro Ile Phe Lys                 380                 385                 390 Asp Leu Phe Asp Leu Gly Val Leu Lys Ala Pro Val Gly Trp Gln                 395                 400                 405 Xaa Val Trp Asp Asp Xaa Gly Ser Gly Lys Xaa Xaa Xaa Tyr Ala                 410                 415                 420 Cys Trp Arg Ala Ile Xaa Xaa Gln Gly Tyr Xaa Xaa Xaa Gly Asp                 425                 430                 435 Val Met Met Leu Ala Xaa Ser Gly Tyr Asn Pro Pro Asn Leu Pro                 440                 445                 450 Asp Tyr Val Cys Xaa His Gln Ser Leu Cys Ala Xaa Val Gln Thr                 455                 460                 465 Leu Xaa Asn Xaa Xaa Trp Trp Asp Xaa Gly Xaa Xaa Xaa Xaa Xaa                 470                 475                 480 Asp Val Ser Leu Trp Xaa Pro Gly Ala Ala Gly Ala Val Ala Ser                 485                 490                 495 Ser Cys Phe Ala Gly Val Pro Asn Tyr Asn Asn Pro Pro Asn Ser                 500                 505                 510 Gly Asp Ile Glu Xaa Leu Arg Gly Ser Ile Ala Cys Val Xaa Thr                 515                 520                 525 Ser Ala Ile Ala Ser Met Gln Glu Met Xaa Ser Met Leu Ser Gln                 530                 535 His Xaa Gly Met Glu Ala Met Met Ser Lys Leu, wherein Xaa at position 2 is Ala or Arg; Xaa at position 24 is Asp or Asn; Xaa at position 42 is Asp or Asn; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu or Asn; Xaa at position 74 is Lys, Glu or Gly; Xaa at position 79 is Lys or Glu; Xaa at position 82 is Glu, Ile, Leu or Tyr; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln or Gly; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile or Met; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp or Asn; Xaa at position 147 is Arg or Glu; Xaa at position 148 is Asp, Phe or Pro; Xaa at position 150 is Arg or Gln; Xaa at position 151 is Asp, Ser, Ala or Asn; Xaa at position 153 is Leu or Ile; Xaa at position 162 is Glu or Gln; Xaa at position 165 is Lys, Glu or Gln; Xaa at position 166 is Arg or Gln; Xaa at position 171 is Tyr or Phe; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 196 is Gln, Lys, Asn or Asp; Xaa at position 203 is Glu, Thr or His; Xaa at position 206 is Tyr or Phe; Xaa at position 216 is Glu or Gln; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser or Gln; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln or Lys; Xaa at position 257 is Gln or Ile; Xaa at position 261 is Gln or Glu; Xaa at position 278 is Glu or Asn; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr or Asp; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 313 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 333 is Ser, Gly, Lys, Val or Asn; Xaa at position 334 is Gly, Arg, Lys or Ile; Xaa at position 336 is Gly or Ala; Xaa at position 338 is Ser, His, Val, Lys or Ala; Xaa at position 339 is Glu, Asn, Ile or Pro; Xaa at position 343 is Val or Ile; Xaa at position 346 is Pro or Ala; Xaa at position 355 is Val or Ile; Xaa at position 359 is Gly or Ala; Xaa at position 391 is Arg, Glu or Gln; Xaa at position 396 is Ala, Leu, Lys, Asn or Gly; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val or Arg; Xaa at position 402 is Lys, Phe, His, Arg, Gly, Trp, Thr, Asn, Tyr or Met; Xaa at position 403 is Asp or Tyr; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg or Glu; Xaa at position 417 is Ala or Ser; Xaa at position 418 is Leu or Met; Xaa at position 426 is Thr or Ser; Xaa at position 440 is Val or Leu; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu or Ile; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp or Gln; Xaa at position 455 is Val or Ile; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile or Tyr; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala or Ser; Xaa at position 464 is Arg, Gly or His; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 500 is Arg or Gln; Xaa at position 509 is Lys or Gln; Xaa at position 520 is Lys, Gln or Glu; and Xaa at position 527 is Gln, Lys, Pro, Cys or Glu; wherein one or more amino acid(s) designated by Xaa in SEQ ID NO: 4 is an amino acid different from the corresponding amino acid of SEQ ID NO: 35 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35.

Embodiment 41 is the PHI-4 polypeptide of embodiment 39 or 40, further comprising one or more amino acid substitutions at position 86, 359, 464, 465, 466, 467, 468, 499 or 517 of SEQ ID NO: 3 or SEQ ID NO: 4.

Embodiment 42 is the PHI-4 polypeptide of embodiment 41, wherein the amino acid at position 86 is Glu or Thr; the amino acid at position 359 is Gly or Ala; the amino acid at position 464 is Arg, Ala, Lys, Asp or Asn; the amino acid at position 465 is Lys or Met, the amino acid at position 467 is Val, Ala, Leu or Thr; the amino acid at position 468 is Ser or Leu; the amino acid at position 499 is Glu or Ala, or the amino acid at position 517 is Glu or Arg.

Embodiment 43 is the PHI-4 polypeptide of embodiment 39-42, further comprising one or more conservative amino acid substitution, insertion of one or more amino acids, deletion of one or more amino acids, and combinations thereof.

Embodiment 44 is the PHI-4 polypeptide of any one of embodiments 39-43, wherein the insecticidal activity is increased about 1.5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 45 is the PHI-4 polypeptide of any one of embodiments 39-43, wherein the insecticidal activity is increased about 2 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 46 is the PHI-4 polypeptide of any one of embodiments 39-43, wherein the insecticidal activity is increased about 2.5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 47 is the PHI-4 polypeptide of any one of embodiments 39-43, wherein the insecticidal activity is increased about 3 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 48 is the PHI-4 polypeptide of any one of embodiments 39-43, wherein the insecticidal activity is increased about 5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 49 is the PHI-4 polypeptide of any one of embodiments 39-48, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is against Western Corn Root Worm (WCRW) larvae.

Embodiment 50 is the PHI-4 polypeptide of any one of embodiments 39-49, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as a Mean FAE Index.

Embodiment 51 is the PHI-4 polypeptide of any one of embodiments 39-49, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as an EC50 value.

Embodiment 52 is the PHI-4 polypeptide of any one of embodiments 39-49, wherein the improved activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as a Mean Deviation Score.

Embodiment 53 is the PHI-4 polypeptide of any one of embodiments 39-49, having 1 to 54 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or 4.

Embodiment 54 is the PHI-4 polypeptide of any one of embodiments 39-49, having 1 to 27 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or 4.

Embodiment 55 is the PHI-4 polypeptide of any one of embodiments 39-49, having 1 to 20 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or 4.

Embodiment 56 is the PHI-4 polypeptide of any one of embodiments 39-49, having 1 to 15 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or 4.

Embodiment 57 is the PHI-4 polypeptide of any one of embodiments 1-56, wherein 1-25 amino acids are deleted from the N-terminus of the PHI-4 polypeptide and/or C-terminus of the PHI-4 polypeptide.

Embodiment 58 is the PHI-4 polypeptide of any one of embodiments 1-53, wherein 1-20 amino acids are deleted from the C-terminus of the PHI-4 polypeptide.

Embodiment 59 is a polynucleotide encoding a PHI-4 polypeptide, wherein the PHI-4 polypeptide has improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 60 is the polynucleotide of embodiment 59, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 1.5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 61 is the polynucleotide of embodiment 59, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 2 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 62 is the polynucleotide of embodiment 59, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 2.5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 63 is the polynucleotide of embodiment 59, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 3 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 64 is the polynucleotide of embodiment 59, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 65 is the polynucleotide of any one of embodiments 59-64, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is against Western Corn Root Worm (WCRW) larvae.

Embodiment 66 is the polynucleotide of any one of embodiments 59-64, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as a Mean FAE Index.

Embodiment 67 is the polynucleotide of any one of embodiments 59-64, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as an EC50 value.

Embodiment 68 is the polynucleotide of any one of embodiments 59-64, wherein the improved activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as a Mean Deviation Score.

Embodiment 69 is the polynucleotide of any one of embodiments 59-68, wherein the PHI-4 polypeptide comprises one or more amino acid substitutions compared to the native amino acid at position 40, 42, 43, 46, 52, 97, 98, 99, 145, 150, 151, 153, 163, 171, 172, 182, 196, 206, 210, 216, 220, 278, 283, 289, 293, 328, 333, 334, 336, 338, 339, 342, 346, 354, 355, 370, 389, 393, 396, 401, 402, 403, 410, 412, 416, 417, 426, 442, 447, 452, 454, 455, 457, 461, 462, 500, 509, 520 or 527 of SEQ ID NO: 35.

Embodiment 70 is the polynucleotide of embodiment 69, wherein the PHI-4 polypeptide further comprises one or more amino acid substitutions compared to the native amino acid at position 86, 359, 464, 465, 466, 467, 468, 499 or 517 of SEQ ID NO: 35.

Embodiment 71 is the polynucleotide of embodiment 70, wherein the amino acid at position 86 is Glu or Thr; the amino acid at position 359 is Gly or Ala; the amino acid at position 464 is Arg, Ala, Lys, Asp or Asn; the amino acid at position 465 is Lys or Met, the amino acid at position 467 is Val, Ala, Leu or Thr; the amino acid at position 468 is Ser or Leu; the amino acid at position 499 is Glu or Ala, or the amino acid at position 517 is Glu or Arg.

Embodiment 72 is the polynucleotide of any one of embodiments 69-71, wherein the amino acid at position 40 is Leu or Ile; the amino acid at position 42 is Asp or Asn; the amino acid at position 43 is Phe or Glu; the amino acid at position 46 is Glu or Asn; the amino acid at position 52 is Ile or Val; the amino acid at position 97 is Arg, Asp, Glu or Asn; the amino acid at position 98 is Tyr or Phe; the amino acid at position 99 is Lys or Leu; the amino acid at position 145 is Leu or Val; the amino acid at position 150 is Arg or Gln; the amino acid at position 151 is Asp or Ser; the amino acid at position 153 is Leu or Ile; the amino acid at position 163 is Leu or Val; the amino acid at position 171 is Tyr or Phe; the amino acid at position 172 is Ile or Leu; the amino acid at position 182 is Asp or Gln; the amino acid at position 196 is Gln or Asn; the amino acid at position 206 is Tyr or Phe; the amino acid at position 210 is Val or Ile; the amino acid at position 216 is Glu or Gln; the amino acid at position 220 is Glu, Gln, His or Asp; the amino acid at position 278 is Glu or Asn; the amino acid at position 283 is Ile or Val; the amino acid at position 289 is Lys, Gln or Leu; the amino acid at position 293 is Arg, Gln or Glu; the amino acid at position 328 is Lys or Glu; the amino acid at position 333 is Ser, Lys or Val; the amino acid at position 334 is Gly, Lys or Arg; the amino acid at position 336 is Gly or Ala; the amino acid at position 338 is Ser or Val; the amino acid at position 339 is Glu, Asn or Gln; the amino acid at position 342 is Ala or Ser; the amino acid at position 346 is Pro or Ala; the amino acid at position 354 is Met or Leu; the amino acid at position 355 is Val or Ile; the amino acid at position 370 is His or Arg; the amino acid at position 389 is Trp or Leu; the amino acid at position 393 is Trp or Leu; the amino acid at position 396 is Ala, Leu, Lys, Thr or Gly; the amino acid at position 401 is Ser, His, Gly, Lys or Pro; the amino acid at position 402 is Lys, His, Gly or Trp; the amino acid at position 403 is Asp or Tyr; the amino acid at position 410 is Ile or Val; the amino acid at position 412 is Pro or Ala; the amino acid at position 416 is Arg or Glu; the amino acid at position 417 is Ala or Ser; the amino acid at position 426 is Thr or Ser; the amino acid at position 442 is Gln or Glu; the amino acid at position 447 is Asp or Lys; the amino acid at position 452 is Gln or Lys; the amino acid at position 454 is Arg or Gln; the amino acid at position 455 is Val or Ile; the amino acid at position 457 is Trp or Asn; the amino acid at position 461 is Thr or Ser; the amino acid at position 462 is Gly or Ala; the amino acid at position 500 is Arg or Gln; the amino acid at position 509 is Lys or Gln; the amino acid at position 520 is Lys, Glu or Gln; and the amino acid at position 527 is Gln or Lys.

Embodiment 73 is the polynucleotide of any one of embodiments 59-69 and 72, wherein the PHI-4 polypeptide has 1 to 54 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 74 is the polynucleotide of any one of embodiments 59-69 and 72, wherein the PHI-4 polypeptide has 1 to 27 amino acid substitutions compared to SEQ ID NO: 2.

Embodiment 75 is the polynucleotide of any one of embodiments 59-69 and 72, wherein the PHI-4 polypeptide has 1 to 20 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 76 is the polynucleotide of any one of embodiments 59-69 and 72, wherein the PHI-4 polypeptide has 1 to 15 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 77 is the polynucleotide of embodiments 70 or 71, wherein the PHI-4 polypeptide has 2 to 54 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 78 is the polynucleotide of embodiments 70 or 71, wherein the PHI-4 polypeptide has 2 to 27 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 79 is the polynucleotide of embodiments 70 or 71, wherein the PHI-4 polypeptide has 2 to 20 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 80 is the polynucleotide of embodiments 70 or 71, wherein the PHI-4 polypeptide has 2 to 15 amino acid substitutions compared to SEQ ID NO: 35.

Embodiment 81 is the polynucleotide of any one of embodiments 59-80, wherein the PHI-4 polypeptide has at least 80% identity to SEQ ID NO: 35.

Embodiment 82 is the polynucleotide of any one of embodiments 59-80, wherein the PHI-4 polypeptide has at least 90% identity to SEQ ID NO: 35.

Embodiment 83 is the polynucleotide of any one of embodiments 59-80, wherein the PHI-4 polypeptide has at least 95% identity to SEQ ID NO: 35.

Embodiment 84 is the polynucleotide of any one of embodiments 59-80, wherein the PHI-4 polypeptide has at least 97% identity to SEQ ID NO: 35.

Embodiment 85 is a polynucleotide encoding a PHI-4 polypeptide, wherein the PHI-4 polypeptide has at least one amino acid substitution at a residue relative to SEQ ID NO: 35 in structural domain selected from:

-   -   a hydrophilic residue;     -   a residue in a membrane insertion initiation loop;     -   a residue in a receptor binding loop; and     -   a residue in a protease sensitive region,

wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35.

Embodiment 86 is the polynucleotide of embodiment 85, wherein the hydrophilic residues are Asp, Glu, Lys, Arg, His, Ser, Thr, Tyr, Trp, Asn, Gln, and Cys. Embodiment 87 is the polynucleotide of embodiment 85 or 86, wherein the membrane insertion loops are between about amino acid at position 92 (Val) and 101 (Ala) and at position 211 (Gly) and 220 (Glu) relative to SEQ ID NO: 35.

Embodiment 88 is the polynucleotide of embodiment 87, wherein the PHI-4 polypeptide has one or more amino substitution compared to the native amino acid at position 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 211, 212, 213, 214, 215, 216, 217, 218, 219, and 220 of SEQ ID NO: 35.

Embodiment 89 is the polynucleotide of any one of embodiments 85, 86, 87 or 88, wherein the receptor binding loops are between about amino acid 332 (Asp) and 340 (Asp), 395 (Asp) and 403 (Asp), 458 (Asp) and 466 (Asp) relative to SEQ ID NO: 35.

Embodiment 90 is the polynucleotide of embodiment 89, wherein the PHI-4 polypeptide has one or more amino substitution compared to the native amino acid at position 332, 333, 334, 335, 336, 337, 338, 339, 340, 395, 396, 397, 398, 399, 400, 401, 402, 403, 458, 459, 460, 461, 462, 463, 464, 465, 466 of SEQ ID NO: 35.

Embodiment 91 is the polynucleotide of any one of embodiments 85, 86, 87, 88, 89 or 90, wherein the protease sensitive region residue is selected from about amino acid residues between position 305 (Lys) and position 316 (Lys) and/or position 500 (Arg) and position 535 (Lys) relative to SEQ ID NO: 35.

Embodiment 92 is the polynucleotide of any one of embodiments 85, 86, 87, 88, 89, 90 or 91, wherein the protease is trypsin.

Embodiment 93 is the polynucleotide of any one of embodiments 85-92, wherein the PHI-4 polypeptide has at least 80% sequence identity to SEQ ID NO: 35.

Embodiment 94 is the polynucleotide of any one of embodiments 85-92, wherein the PHI-4 polypeptide has at least 90% sequence identity to SEQ ID NO: 35.

Embodiment 95 is the polynucleotide of any one of embodiments 85-92, wherein the PHI-4 polypeptide has at least 95% sequence identity to SEQ ID NO: 35.

Embodiment 96 is the polynucleotide of any one of embodiments 85-92, wherein the PHI-4 polypeptide has at least 97% sequence identity to SEQ ID NO: 35.

Embodiment 97 is a polynucleotide encoding a PHI-4 polypeptide, wherein the PHI-4 polypeptide comprises an amino acid sequence of the formula,

(SEQ ID NO: 3)                 5                   10                  15 Met Xaa Ser Ala Ala Asn Ala Gly Xaa Leu Gly Asn Leu Xaa Gly                 20                  25                  30 Xaa Thr Ser Xaa Gly Met Xaa Tyr Xaa Val Asn Gly Leu Tyr Ala                 35                  40                  45 Ser Pro Glu Ser Leu Xaa Gly Gln Pro Leu Phe Xaa Xaa Gly Gly                 50                  55                  60 Xaa Leu Asp Ser Xaa Xaa Ile Glu Gly Xaa Xaa Xaa Xaa Phe Pro                 65                  70                  75 Xaa Ser Met His Val His Thr Tyr Phe His Ser Asp Xaa Xaa Gln                 80                  85                  90 Xaa Val Ser Xaa Xaa Ile Xaa Xaa Xaa Arg Xaa Xaa Xaa Ser Xaa                 95                  100                 105 His Val Gly Xaa Ser Gly Xaa Xaa Xaa Leu Phe Ser Xaa Ser Xaa                 110                 115                 120 Ser Val Asp Xaa Thr Thr Xaa Xaa Gln Gln Leu Xaa Glu Ile Thr                 125                 130                 135 Xaa Ser Ser Thr Arg Glu Xaa His Val Leu Trp Tyr Ile Ser Leu                 140                 145                 150 Pro Gly Ala Ala Thr Leu Xaa Ser Met Leu Xaa Xaa Xaa Xaa Xaa                 155                 160                 165 Xaa Asp Xaa Xaa Xaa Pro Asn Met Xaa Ala Met Xaa Leu Phe Xaa                 170                 175                 180 Xaa Xaa Gly Pro Xaa Xaa Xaa Xaa Xaa Ala Ala Val Gly Gly Arg                 185                 190                 195 Leu Xaa Xaa Xaa Xaa Ala Ser Lys Xaa Leu Xaa Met Xaa Ser Ser                 200                 205                 210 Xaa Ser Leu Ser Thr Thr Xaa Xaa Xaa Ser Xaa Xaa Ala Xaa Xaa                 215                 220                 225 Gly Glu Ile Xaa Ile Xaa His Gly Ser Xaa Met Glu Lys Gln Val                 230                 235                 240 Asn Ser Phe Xaa Xaa Xaa Ser Thr Ile Arg Xaa Thr Ala Thr Gly                 245                 250                 255 Gly Lys Pro Gly Xaa Thr Xaa Arg Ile Leu His Gly Pro Asp Ser                 260                 265                 270 Xaa Xaa Ala Phe Ser Xaa Trp Ala Xaa Ser Leu Leu Xaa Tyr Ala                 275                 280                 285 Thr Leu Met Asp Phe Xaa Thr Xaa Ser Leu Xaa Xaa Ile Xaa Ala                 290                 295                 300 Leu Xaa Asp Xaa Pro Xaa Xaa Xaa Xaa Glu Xaa Xaa Xaa Ala Xaa                 305                 310                 315 Pro Xaa Xaa Met Xaa Xaa Ser Gln Xaa Ser Ile Pro Xaa Val Asp                 320                 325                 330 Xaa Val Leu Leu Met Asp Ala Arg Pro Pro Met Val Xaa Ala Gly                 335                 340                 345 Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa                 350                 355                 360 Xaa Ser Thr Ser Xaa Xaa Tyr Lys Xaa Xaa Gly Gln Phe Xaa Gln                 365                 370                 375 Arg Xaa His Xaa Ser Val Ala Asp Gly His Xaa Pro Ile Xaa Xaa                 380                 385                 390 Asp Leu Phe Asp Xaa Gly Xaa Xaa Xaa Xaa Pro Val Gly Xaa Gln                 395                 400                 405 Xaa Val Trp Asp Xaa Xaa Xaa Xaa Gly Lys Xaa Xaa Xaa Tyr Xaa                 410                 415                 420 Cys Trp Arg Xaa Xaa Xaa Xaa Gln Gly Tyr Xaa Xaa Xaa Gly Asp                 425                 430                 435 Val Xaa Met Leu Ala Xaa Ser Gly Tyr Asn Pro Pro Asn Leu Pro                 440                 445                 450 Xaa Xaa Xaa Cys Xaa His Xaa Ser Leu Xaa Ala Xaa Xaa Xaa Thr                 455                 460                 465 Leu Xaa Xaa Xaa Xaa Trp Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa                 470                 475                 480 Xaa Val Ser Leu Trp Xaa Pro Gly Ala Ala Gly Ala Val Ala Ser                 485                 490                 495 Ser Cys Phe Ala Gly Val Pro Asn Tyr Asn Asn Pro Pro Asn Ser                 500                 505                 510 Gly Xaa Ile Xaa Xaa Leu Xaa Gly Ser Ile Ala Cys Val Xaa Thr                 515                 520                 525 Ser Ala Ile Ala Ser Met Xaa Xaa Met Xaa Ser Met Leu Ser Xaa                 530                 535 His Xaa Gly Met Glu Ala Met Met Ser Lys Leu, wherein Xaa at position 2 is Ala or Arg; Xaa at position 9 is Gln, Lys or Glu; Xaa at position 14 is Pro or Ala; Xaa at position 16 is Val or Asp; Xaa at position 19 is Met or Leu; Xaa at position 22 is Gly or Ser; Xaa at position 24 is Asp, Asn or Gln; Xaa at position 36 is Leu or Met; Xaa at position 42 is Asp, Asn or Gln; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu, Asp, Asn or Gly; Xaa at position 50 is Ile or Val; Xaa at position 51 is Glu or Gln; Xaa at position 55 is Arg or Lys; Xaa at position 56 is Ser or Thr; Xaa at position 57 is Tyr or Phe; Xaa at position 58 is Thr or Ser; Xaa at position 61 is Arg, Lys or Glu; Xaa at position 73 is Phe or Tyr; Xaa at position 74 is Lys, Glu, Gly, Arg, Met, Leu, His or Asp; Xaa at position 76 is Asp or Gln; Xaa at position 79 is Lys or Glu; Xaa at position 80 is Glu or Ser; Xaa at position 82 is Glu, Ile, Leu, Tyr or Gln; Xaa at position 83 is Glu or Gln; Xaa at position 84 is Tyr or Phe; Xaa at position 86 is Glu or Gln; Xaa at position 87 is Lys or Gln; Xaa at position 88 is Met, Ile or Leu; Xaa at position 90 is Gln or Glu; Xaa at position 94 is Val or Ile; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln, Gly or Ser; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile, Met, Phe, Cys, Val or Asn; Xaa at position 103 is Ala or Gly; Xaa at position 105 is Leu or Ile; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp, Asn, Glu, Cys, Ala or Arg; Xaa at position 112 is Thr or Ser; Xaa at position 113 is Asp, Glu or Met; Xaa at position 117 is Thr or Ser; Xaa at position 121 is Tyr or Phe; Xaa at position 127 is Ala or Thr; Xaa at position 142 is Arg or Glu; Xaa at position 146 is Arg or Gln; Xaa at position 147 is Arg, Glu or Gln; Xaa at position 148 is Asp, Phe, Pro, Val, Glu, His, Trp, Ala, Arg, Leu, Ser, Gln or Gly; Xaa at position 149 is Phe or Val; Xaa at position 150 is Arg, Gln or Glu; Xaa at position 151 is Asp, Ser, Ala, Asn, Trp, Val, Gln, Cys, Met, Leu, Arg or Glu; Xaa at position 153 is Leu or Ile; Xaa at position 154 is Asn or Asp; Xaa at position 155 is Asn or Lys; Xaa at position 159 is Pro or Asp; Xaa at position 162 is Glu, Asp or Gln; Xaa at position 165 is Lys, Glu, Gln, Pro, Thr, Ala, Leu, Gly, Asp, Val, His, Ile, Met, Trp, Phe, Tyr or Arg; Xaa at position 166 is Arg or Gln; Xaa at position 167 is Tyr, Trp or Cys; Xaa at position 170 is Tyr or His; Xaa at position 171 is Tyr or Phe; Xaa at position 172 is Ile, Leu or Val; Xaa at position 173 is Ser or Ala; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 183 is Tyr or Val; Xaa at position 184 is Ser or Thr; Xaa at position 185 is Ala or Ser; Xaa at position 189 is Thr, Lys or Ile; Xaa at position 191 is Lys or Gln; Xaa at position 193 is Asp or Asn; Xaa at position 196 is Gln, Lys, Asn, Asp, Glu, Ala, Ile or Arg; Xaa at position 202 is Ala or Val; Xaa at position 203 is Glu, Thr or His; Xaa at position 204 is Met or Ala; Xaa at position 206 is Tyr or Phe; Xaa at position 207 is Lys or Gln; Xaa at position 209 is Leu or Pro; Xaa at position 210 is Val or Ile; Xaa at position 214 is Lys, Ser or Gln; Xaa at position 216 is Glu, Gln, Phe, Val, Tyr or Arg; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser, Gln, Arg, Trp, Met, Ala, Phe, Ile, Leu, Cys or Asn; Xaa at position 229 is Arg or Glu; Xaa at position 230 is Ser or Glu; Xaa at position 231 is Asn or Ser; Xaa at position 236 is Leu or Pro; Xaa at position 245 is Met or Leu; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln, Lys or Glu; Xaa at position 257 is Gln, Ile, Glu, Cys, Ser, His, Trp or Met; Xaa at position 261 is Gln, Glu or Lys; Xaa at position 264 is Glu or Gln; Xaa at position 268 is Asp or Asn; Xaa at position 276 is Ser or Ala; Xaa at position 278 is Glu, Asn or Gln; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 282 is Pro or Gly; Xaa at position 284 is Trp or Arg; Xaa at position 287 is Ala or Cys; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr, Asp, Phe, Ser, Met, Arg, Trp, Ile, His, Asn, Cys, Gly or Ala; Xaa at position 291 is Glu or Gln; Xaa at position 292 is Arg or Gln; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 294 is Val or Ala; Xaa at position 296 is Leu or Ile; Xaa at position 297 is Glu or Gln; Xaa at position 298 is Asp or Gln; Xaa at position 300 is Phe or Tyr; Xaa at position 302 is Glu or Gln; Xaa at position 303 is Phe or Tyr; Xaa at position 305 is Lys or Gln; Xaa at position 306 is Gln or Lys; Xaa at position 309 is Gln, Lys or Glu; Xaa at position 313 is Lys, Gln or Arg; Xaa at position 316 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 331 is Glu, Asn or Gln; Xaa at position 333 is Ser, Arg, Gly, Lys, Val, Asn, Ala, His, Gln, Thr, Asp, Ile, Leu, Cys or Glu; Xaa at position 334 is Gly, Arg, Lys, Ile or Trp; Xaa at position 335 is Ser or Ala; Xaa at position 336 is Gly or Ala; Xaa at position 337 is Ala, Val or Gly; Xaa at position 338 is Ser, His, Val, Lys, Ala, Gly, Thr, Ile, Glu, Met, Arg, Pro, Asp, Asn or Leu; Xaa at position 339 is Glu, Asn, Gln, Ile, Pro, Met, Ser, Ala, Cys, Phe, Val, Leu, Asp, Trp, His or Arg; Xaa at position 341 is Leu or Val; Xaa at position 342 is Ala, Ser or Val; Xaa at position 343 is Val or Ile; Xaa at position 344 is Phe or Trp; Xaa at position 345 is Asn or His; Xaa at position 346 is Pro or Ala; Xaa at position 350 is Asn or Ser; Xaa at position 351 is Gly or Val; Xaa at position 354 is Met or Leu; Xaa at position 355 is Val, Ile or Leu; Xaa at position 359 is Gly or Ala; Xaa at position 362 is Asn or Ser; Xaa at position 364 is Ala or Ser; Xaa at position 371 is Ala, Gly or Thr; Xaa at position 374 is Phe or Ile; Xaa at position 375 is Lys or Arg; Xaa at position 380 is Leu or Gly; Xaa at position 382 is Val, Asp or Leu; Xaa at position 383 is Leu, Ile or Val; Xaa at position 384 is Lys, Ala or Gly; Xaa at position 385 is Ala or Gly; Xaa at position 389 is Trp or Tyr; Xaa at position 391 is Arg, Leu, Glu, Gln or Asp; Xaa at position 395 is Asp or Cys; Xaa at position 396 is Ala, Leu, Lys, Asn, Gly, Ile, Met, Arg, Tyr, Gln or His; Xaa at position 397 is Gly, Arg or Ala; Xaa at position 398 is Ser, Gln or Cys; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val, Arg, Ile, Asn, Phe, Thr, Ala, Asp, Met, Gln or Glu; Xaa at position 402 is Lys, Phe, His, Arg, Trp, Gly, Asn, Leu, Tyr, Thr, Val, Met, Pro or Ala; Xaa at position 403 is Asp, Tyr, Trp, Phe or Glu; Xaa at position 405 is Ala or Ser; Xaa at position 409 is Ala or Pro; Xaa at position 410 is Ile or Val; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg, Glu or Gln; Xaa at position 417 is Ala, Ser or Cys; Xaa at position 418 is Leu or Met; Xaa at position 422 is Met or Val; Xaa at position 426 is Thr or Ser; Xaa at position 436 is Asp or Lys; Xaa at position 437 is Tyr or Val; Xaa at position 438 is Val or Arg; Xaa at position 440 is Val or Leu; Xaa at position 442 is Gln, Lys or Glu; Xaa at position 445 is Cys, Leu or Thr; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu, Ile, Gly, Pro, Leu, Phe, Trp or Thr; Xaa at position 448 is Val or Ala; Xaa at position 449 is Gln or Glu; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 453 is Asn or Asp; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp, Gln, Gly, His, Asp, Leu, Thr, Pro or Asn; Xaa at position 455 is Val or Ile; Xaa at position 457 is Trp or Asn; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile, Thr, Ser, His, Cys, Tyr, Pro, Asn, Ala, Arg or Glu; Xaa at position 460 is Gly or Ala; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala, Ser or Gly; Xaa at position 464 is Arg, Gly, His, Gln, Thr or Phe; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly, Arg, Thr, His, Cys, Trp, Phe or Leu; Xaa at position 466 is Asp or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 497 is Asp or Gln; Xaa at position 499 is Glu or Gln; Xaa at position 500 is Arg, Gln or Lys; Xaa at position 502 is Arg, Glu or Gln; Xaa at position 509 is Lys, Gln or Glu; Xaa at position 517 is Gln, Cys, Asn, Val or Pro; Xaa at position 518 is Glu or Gln; Xaa at position 520 is Lys, Gln or Glu; Xaa at position 525 is Gln or Lys; and Xaa at position 527 is Gln, Lys, Pro, Cys, Glu, Ser, His, Phe or Trp; wherein one or more amino acid(s) designated by Xaa in SEQ ID NO: 3 is an amino acid different from the corresponding amino acid of SEQ ID NO: 35 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35.

Embodiment 98 is a polynucleotide encoding a PHI-4 polypeptide, wherein the PHI-4 polypeptide comprises an amino acid sequence of the formula,

(SEQ ID NO: 4)                 5                   10                  15 Met Xaa Ser Ala Ala Asn Ala Gly Gln Leu Gly Asn Leu Pro Gly                 20                  25                  30 Val Thr Ser Met Gly Met Gly Tyr Xaa Val Asn Gly Leu Tyr Ala                 35                  40                  45 Ser Pro Glu Ser Leu Leu Gly Gln Pro Leu Phe Xaa Xaa Gly Gly                 50                  55                  60 Xaa Leu Asp Ser Ile Glu Ile Glu Gly Arg Ser Tyr Thr Phe Pro                 65                  70                  75 Arg Ser Met His Val His Thr Tyr Phe His Ser Asp Phe Xaa Gln                 80                  85                  90 Asp Val Ser Xaa Glu Ile Xaa Glu Tyr Arg Glu Lys Met Ser Gln                 95                  100                 105 His Val Gly Val Ser Gly Xaa Xaa Xaa Leu Phe Ser Ala Ser Leu                 110                 115                 120 Ser Val Asp Xaa Thr Thr Thr Asp Gln Gln Leu Thr Glu Ile Thr                 125                 130                 135 Tyr Ser Ser Thr Arg Glu Ala His Val Leu Trp Tyr Ile Ser Leu                 140                 145                 150 Pro Gly Ala Ala Thr Leu Arg Ser Met Leu Arg Xaa Xaa Phe Xaa                 155                 160                 165 Xaa Asp Xaa Asn Asn Pro Asn Met Pro Ala Met Xaa Leu Phe Xaa                 170                 175                 180 Xaa Tyr Gly Pro Tyr Xaa Ile Ser Xaa Ala Ala Val Gly Gly Arg                 185                 190                 195 Leu Xaa Tyr Ser Ala Ala Ser Lys Thr Leu Lys Met Asp Ser Ser                 200                 205                 210 Xaa Ser Leu Ser Thr Thr Ala Xaa Met Ser Xaa Lys Ala Leu Val                 215                 220                 225 Gly Glu Ile Lys Ile Xaa His Gly Ser Xaa Met Glu Lys Gln Val                 230                 235                 240 Asn Ser Phe Arg Ser Asn Ser Thr Ile Arg Leu Thr Ala Thr Gly                 245                 250                 255 Gly Lys Pro Gly Met Thr Xaa Arg Ile Leu His Gly Pro Asp Ser                 260                 265                 270 Xaa Xaa Ala Phe Ser Xaa Trp Ala Glu Ser Leu Leu Asp Tyr Ala                 275                 280                 285 Thr Leu Met Asp Phe Ser Thr Xaa Ser Leu Xaa Pro Ile Trp Ala                 290                 295                 300 Leu Ala Asp Xaa Pro Glu Arg Xaa Val Glu Leu Glu Asp Ala Phe                 305                 310                 315 Pro Glu Phe Met Lys Gln Ser Gln Gln Ser Ile Pro Xaa Val Asp                 320                 325                 330 Lys Val Leu Leu Met Asp Ala Arg Pro Pro Met Val Xaa Ala Gly                 335                 340                 345 Glu Asp Xaa Xaa Ser Xaa Ala Xaa Xaa Asp Leu Ala Xaa Phe Asn                 350                 355                 360 Xaa Ser Thr Ser Asn Gly Tyr Lys Met Xaa Gly Gln Phe Xaa Gln                 365                 370                 375 Arg Asn His Ala Ser Val Ala Asp Gly His Ala Pro Ile Phe Lys                 380                 385                 390 Asp Leu Phe Asp Leu Gly Val Leu Lys Ala Pro Val Gly Trp Gln                 395                 400                 405 Xaa Val Trp Asp Asp Xaa Gly Ser Gly Lys Xaa Xaa Xaa Tyr Ala                 410                 415                 420 Cys Trp Arg Ala Ile Xaa Xaa Gln Gly Tyr Xaa Xaa Xaa Gly Asp                 425                 430                 435 Val Met Met Leu Ala Xaa Ser Gly Tyr Asn Pro Pro Asn Leu Pro                 440                 445                 450 Asp Tyr Val Cys Xaa His Gln Ser Leu Cys Ala Xaa Val Gln Thr                 455                 460                 465 Leu Xaa Asn Xaa Xaa Trp Trp Asp Xaa Gly Xaa Xaa Xaa Xaa Xaa                 470                 475                 480 Asp Val Ser Leu Trp Xaa Pro Gly Ala Ala Gly Ala Val Ala Ser                 485                 490                 495 Ser Cys Phe Ala Gly Val Pro Asn Tyr Asn Asn Pro Pro Asn Ser                 500                 505                 510 Gly Asp Ile Glu Xaa Leu Arg Gly Ser Ile Ala Cys Val Xaa Thr                 515                 520                 525 Ser Ala Ile Ala Ser Met Gln Glu Met Xaa Ser Met Leu Ser Gln                 530                 535 His Xaa Gly Met Glu Ala Met Met Ser Lys Leu, wherein Xaa at position 2 is Ala or Arg; Xaa at position 24 is Asp or Asn; Xaa at position 42 is Asp or Asn; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu or Asn; Xaa at position 74 is Lys, Glu or Gly; Xaa at position 79 is Lys or Glu; Xaa at position 82 is Glu, Ile, Leu or Tyr; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln or Gly; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile or Met; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp or Asn; Xaa at position 147 is Arg or Glu; Xaa at position 148 is Asp, Phe or Pro; Xaa at position 150 is Arg or Gln; Xaa at position 151 is Asp, Ser, Ala or Asn; Xaa at position 153 is Leu or Ile; Xaa at position 162 is Glu or Gln; Xaa at position 165 is Lys, Glu or Gln; Xaa at position 166 is Arg or Gln; Xaa at position 171 is Tyr or Phe; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 196 is Gln, Lys, Asn or Asp; Xaa at position 203 is Glu, Thr or His; Xaa at position 206 is Tyr or Phe; Xaa at position 216 is Glu or Gln; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser or Gln; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln or Lys; Xaa at position 257 is Gln or Ile; Xaa at position 261 is Gln or Glu; Xaa at position 278 is Glu or Asn; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr or Asp; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 313 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 333 is Ser, Gly, Lys, Val or Asn; Xaa at position 334 is Gly, Arg, Lys or Ile; Xaa at position 336 is Gly or Ala; Xaa at position 338 is Ser, His, Val, Lys or Ala; Xaa at position 339 is Glu, Asn, Ile or Pro; Xaa at position 343 is Val or Ile; Xaa at position 346 is Pro or Ala; Xaa at position 355 is Val or Ile; Xaa at position 359 is Gly or Ala; Xaa at position 391 is Arg, Glu or Gln; Xaa at position 396 is Ala, Leu, Lys, Asn or Gly; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val or Arg; Xaa at position 402 is Lys, Phe, His, Arg, Gly, Trp, Thr, Asn, Tyr or Met; Xaa at position 403 is Asp or Tyr; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg or Glu; Xaa at position 417 is Ala or Ser; Xaa at position 418 is Leu or Met; Xaa at position 426 is Thr or Ser; Xaa at position 440 is Val or Leu; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu or Ile; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp or Gln; Xaa at position 455 is Val or Ile; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile or Tyr; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala or Ser; Xaa at position 464 is Arg, Gly or His; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 500 is Arg or Gln; Xaa at position 509 is Lys or Gln; Xaa at position 520 is Lys, Gln or Glu; and Xaa at position 527 is Gln, Lys, Pro, Cys or Glu; wherein one or more amino acid(s) designated by Xaa in SEQ ID NO: 4 is an amino acid different from the corresponding amino acid of SEQ ID NO: 35 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35.

Embodiment 99 is the polynucleotide of embodiments 97 or 98, wherein the PHI-4 polypeptide further comprises one or more amino acid substitutions at position 86, 359, 464, 465, 466, 467, 468, 499 or 517 of SEQ ID NO: 3 or SEQ ID NO: 4.

Embodiment 100 is the polynucleotide of embodiment 99, wherein the amino acid at position 86 is Glu or Thr; the amino acid at position 359 is Gly or Ala; the amino acid at position 464 is Arg, Ala, Lys, Asp or Asn; the amino acid at position 465 is Lys or Met, the amino acid at position 467 is Val, Ala, Leu or Thr; the amino acid at position 468 is Ser or Leu; the amino acid at position 499 is Glu or Ala, or the amino acid at position 517 is Glu or Arg.

Embodiment 101 is the polynucleotide of any one of embodiments 97-100, wherein the PHI-4 polypeptide further comprises one or more more conservative amino acid substitution, insertion of one or more amino acids, deletion of one or more amino acids, and combinations thereof.

Embodiment 102 is the polynucleotide of any one of embodiments 97-101, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 1.5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 103 is the polynucleotide of any one of embodiments 97-101, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 2 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 104 is the polynucleotide of any one of embodiments 97-101, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 2.5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 105 is the polynucleotide of any one of embodiments 97-101, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 3 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 106 is the polynucleotide of any one of embodiments 97-101, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).

Embodiment 107 is the polynucleotide of any one of embodiments 97-106, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is against Western Corn Root Worm (WCRW) larvae.

Embodiment 108 is the polynucleotide of any one of embodiments 97-107, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as a Mean FAE Index.

Embodiment 109 is the polynucleotide of any one of embodiments 97-107, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as an EC50 value.

Embodiment 110 is the polynucleotide of any one of embodiments 97-107, wherein the improved activity compared to AXMI-205 (SEQ ID NO: 35) is quantified as a Mean Deviation Score.

Embodiment 111 is the polynucleotide of any one of embodiments 97-110, wherein the PHI-4 polypeptide has 1 to 54 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or 4.

Embodiment 112 is the polynucleotide of any one of embodiments 97-110, wherein the PHI-4 polypeptide has 1 to 27 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or 4.

Embodiment 113 is the polynucleotide of any one of embodiments 97-110, wherein the PHI-4 polypeptide has 1 to 20 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or 4.

Embodiment 114 is the polynucleotide of any one of embodiments 97-110, wherein the PHI-4 polypeptide has 1 to 15 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or 4.

Embodiment 115 is the polynucleotide of any one of embodiments 97-114, wherein 1-25 amino acids are deleted from the N-terminus of the PHI-4 polypeptide and/or C-terminus of the PHI-4 polypeptide.

Embodiment 116 is the polynucleotide of any one of embodiments 97-114, wherein 1-20 amino acids are deleted from the C-terminus of the PHI-4 polypeptide.

Embodiment 117 is a composition, comprising an insecticidally-effective amount of the PHI-4 polypeptide of any one of embodiments 1-58.

Embodiment 118 is a method of inhibiting growth or killing an insect pest, comprising contacting the insect pest with the composition of embodiment 117.

Embodiment 119 is a method for controlling an insect pest population resistant to a pesticidal protein, comprising contacting the resistant insect pest population with the composition of embodiment 117.

Embodiment 120 is the method of controlling an insect pest population resistant to an pesticidal protein of embodiment 119, wherein the pesticidal protein is selected from Cry1Ac, Cry1Ab, Cry1A.105, Cry1Ac, Cry1F, Cry1Fa2, Cry1F, Cry2Ab, Cry3A, mCry3A, Cry3Bb1, Cry34Ab1, Cry35Ab1, Vip3A, Cry9c, eCry3.1Ab and CBI-Bt.

Embodiment 121 is a transgenic plant or progeny thereof, comprising the polynucleotide of any one of embodiments 59-116. Embodiment 122 is the transgenic plant or progeny thereof of embodiment 121, wherein the transgenic plant is a monocotyledon.

Embodiment 123 is the transgenic plant or progeny thereof of embodiment 122, wherein the plant is selected from barley, corn, oat, rice, rye, sorghum, turf grass, sugarcane, wheat, alfalfa, banana, broccoli, bean, cabbage, canola, carrot, cassava, cauliflower, celery, citrus, cotton, a cucurbit, eucalyptus, flax, garlic, grape, onion, lettuce, pea, peanut, pepper, potato, poplar, pine, sunflower, safflower, soybean, strawberry, sugar beet, sweet potato, tobacco, tomato ornamental, shrub, nut, chickpea, pigeon pea, millets, hops and pasture grasses.

Embodiment 124 is the transgenic plant or progeny thereof of embodiment 123, further comprising one or more additional transgenic traits.

Embodiment 125 is the transgenic plant of embodiment 124, wherein the one or more additional transgenic trait is selected from insect resistance, herbicide resistance, fungal resistance, viral resistance, stress tolerance, disease resistance, male sterility, stalk strength, increased yield, modified starches, improved oil profile, balanced amino acids, high lysine or methionine, increased digestibility, improved fiber quality, flowering, ear and seed development, enhancement of nitrogen utilization efficiency, altered nitrogen responsiveness, drought resistance or tolerance, cold resistance or tolerance, salt resistance or tolerance and increased yield under stress.

Embodiment 126 is seed, grain or processed product thereof of the transgenic plant of any one of embodiments 121-126, wherein the seed, grain, or processed product thereof comprises the polynucleotide of any one of embodiments 121-125.

Embodiment 127 is an expression cassette, comprising the polynucleotide of any one of embodiments 59-116 operably linked to one or more regulatory sequences directing expression of the PHI-4 polypeptide.

Embodiment 128 is a transgenic plant or plant cell, comprising the expression cassette of embodiment 127.

Embodiment 129 is seed, grain or processed product thereof of the transgenic plant of embodiment 128, wherein the seed, grain, or processed product thereof comprises the recombinant nucleic acid molecule of embodiment of 1 and the recombinant nucleic acid molecule of 24.

Embodiment 130 is the seed of embodiment 129, wherein one or more seed treatment has been applied to the seed.

Embodiment 131 is a method for expressing in a plant a polynucleotide encoding an insecticidal protein, comprising

(a) inserting into a plant cell the polynucleotide of any one of embodiment 59-116 encoding the PHI-4 polypeptide;

(b) obtaining a transformed plant cell comprising the nucleic acid sequence of step (a); and

(c) generating from the transformed plant cell a plant capable of expressing the polynucleotide encoding the PHI-4 polypeptide.

Embodiment 132 is a method for protecting a plant from an insect pest, comprising expressing in the plant or cell thereof, an insecticidally-effective amount of the PHI-4 polypeptide of any one of embodiments 1-58.

Embodiment 133 is a method for controlling an insect pest population, comprising contacting the insect pest population with an insecticidally-effective amount of the PHI-4 polypeptide of any one of embodiments 1-58.

Embodiment 134 is a method of inhibiting growth or killing an insect pest, comprising contacting the insect pest with a composition comprising an insecticidally-effective amount of the the PHI-4 polypeptide of any one of embodiments 1-58.

Embodiment 135 is a method for controlling an insect pest population resistant to a pesticidal protein, comprising contacting the insect pest population with an insecticidally-effective amount of the PHI-4 polypeptide of any one of embodiments 1-58.

Embodiment 136 is a fusion protein comprising the PHI-4 polypeptide of any one of embodiments 1-58.

Embodiment 137 is a fusion protein comprising the PHI-4 polypeptide of any one of embodiments 1-58 and a maltose binding protein.

Embodiment 138 is the fusion protein of embodiment 137, wherein the maltose binding protein has an amino acid sequence of SEQ ID NO: 830 or SEQ ID NO: 831.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows FAE analysis of MPB::PHI-4-SFR12-004 (SEQ ID NO 31). The reference protein is MBP::PHI-4 polypeptide of (SEQ ID NO: 6). The % Response is given on the y axis. The dose of the toxin fragment is given on the x axis in parts per million (ppm). Half dose (+) and full dose (*) indicate a mean response from six replicate wells at the indicated concentration. All data are from a single experiment.

FIG. 2 shows EC50 analysis of MBP::PHI-4 polypeptide of SEQ ID NO 6 and MPB::PHI-4-SFR12-004 (SEQ ID NO 35). The % Response is given on they axis. The dose of the toxin fragment is given on the x axis in parts per million (ppm). The % Response is given on the y axis. The dose of the toxin fragment is given on the x axis in parts per million (ppm). The protein concentration (toxin portion of the protein only) is given on the x axis. Each symbol indicates a mean response from twenty-four replicate wells at the indicated concentration. All data are from a single experiment. triangles: MBP::PHI-4 (SEQ ID NO: 6); circles: MBP::PHI-4-SFR12-004 (SEQ ID NO: 31).

FIG. 3 shows the amino acids sequence of the C-terminal portion of the PHI-4 polypeptide (SEQ ID NO: 2) is given. Three stretches of sequence corresponding to nine amino acid motifs that align to the putative sugar binding loop motif D-X-G-(S/T)-G-X₃-D (SEQ ID NO:40) are highlighted in grey.

FIG. 4 shows EC50 data for SEQ ID NO: 610, SEQ ID NO: 595, SEQ ID NO: 584, SEQ ID NO: 591, SEQ ID NO: 576, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 79, SEQ ID NO: 81, SEQ ID NO: 150, SEQ ID NO: 150, SEQ ID NO: 149, SEQ ID NO: 167, SEQ ID NO: 167, SEQ ID NO: 164, SEQ ID NO: 164, SEQ ID NO: 170, SEQ ID NO: 170, SEQ ID NO: 795, SEQ ID NO: 794, SEQ ID NO: 784, SEQ ID NO: 799, SEQ ID NO: 785, SEQ ID NO: 788, SEQ ID NO: 786, SEQ ID NO: 796, SEQ ID NO: 787. The x-axis corresponds to the mean fold improvement in EC50, relative to MBP::PHI-4 (SEQ ID NO_: 6) from at least three independent EC50 measurements. The y-axis corresponds to the mean fold improvement of the mean FAE Index from at least three independent measurements.

FIG. 5. TO seedlings in the V3-V4 growth stage were challenged as described (Oleson J. et al J. Economic Entomology 98:1-8; 2005) and root nodal injury scores were recorded. The groups indicated along the x axis were either controls or were transformed with a plant vector of Example 18: 1, untransformed control; 2, positive control transgenic expressing positive control protein; 3, PHI-4 polypeptide (SEQ ID NO: 22); 4, PHI-4-B09 polypeptide (SEQ ID NO: 23); 5, PHI-4-D09 polypeptide (SEQ ID NO: 24); 6 PHI-4-H08 polypeptide (SEQ ID NO: 25).

DETAILED DESCRIPTION

It is to be understood that this invention is not limited to the particular methodology, protocols, cell lines, genera, and reagents described, as such may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention.

As used herein the singular forms “a”, “and”, and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a cell” includes a plurality of such cells and reference to “the protein” includes reference to one or more proteins and equivalents thereof known to those skilled in the art, and so forth. All technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this invention belongs unless clearly indicated otherwise.

The present disclosure is drawn to compositions and methods for controlling pests. The methods involve transforming organisms with a nucleic acid sequence encoding a PHI-4 polypeptide. In particular, the nucleic acid sequences of the embodiments are useful for preparing plants and microorganisms possessing pesticidal activity. Thus, transformed bacteria, plants, plant cells, plant tissues and seeds are provided. Compositions are pesticidal nucleic acids and proteins of bacterial species. The nucleic acid sequences find use in the construction of expression vectors for subsequent transformation into organisms of interest, as probes for the isolation of other homologous (or partially homologous) genes, and for the generation of altered PHI-4 polypeptides by methods known in the art, such as site directed mutagenesis, domain swapping or DNA shuffling. The PHI-4 polypeptides find use in controlling, inhibiting growth or killing Lepidopteran, Coleopteran, Dipteran, fungal, Hemipteran, and nematode pest populations and for producing compositions with pesticidal activity. Insect pests of interest include, but are not limited to, the superfamily of stink bugs and other related insects including, but not limited to, species belonging to the family Pentatomidae (Nezara viridula, Halyomorpha halys, Piezodorus guildini, Euschistus servus, Acrosternum hilare, Euschistus heros, Euschistus tristigmus, Acrosternum hilare, Dichelops furcatus, Dichelops melacanthus, and Bagrada hilaris (Bagrada Bug)), the family Plataspidae (Megacopta cribraria—Bean plataspid), and the family Cydnidae (Scaptocoris castanea—Root stink bug) and Lepidoptera species including but not limited to: diamond-back moth, e.g., Helicoverpa zea Boddie; soybean looper, e.g., Pseudoplusia includens Walker and velvet bean caterpillar e.g., Anticarsia gemmatalis Hübner.

By “pesticidal toxin” or “pesticidal protein” is intended a toxin that has toxic activity against one or more pests, including, but not limited to, members of the Lepidoptera, Diptera, Hemiptera and Coleoptera orders or the Nematoda phylum or a protein that has homology to such a protein. Pesticidal proteins have been isolated from organisms including, for example, Bacillus sp., Pseudomonas sp., Photorhabdus sp., Xenorhabdus sp., Clostridium bifermentans and Paenibacillus popilliae. Pesticidal proteins include but are not limited to: insecticidal proteins from Pseudomonas sp. such as PSEEN3174 (Monalysin, (2011) PLoS Pathogens, 7:1-13) from Pseudomonas protegens strain CHAO and Pf-5 (previously fluorescens) (Pechy-Tarr, (2008) Environmental Microbiology 10:2368-2386: GenBank Accession No. EU400157); from Pseudomonas Taiwanensis (Liu, et al., (2010) J. Agric. Food Chem. 58:12343-12349) and from Pseudomonas pseudoalcligenes (Zhang, et al, (2009) Annals of Microbiology 59:45-50 and Li, et al., (2007) Plant Cell Tiss. Organ Cult. 89:159-168); insecticidal proteins from Photorhabdus sp. and Xenorhabdus sp. (Hinchliffe, et al., (2010) The Open Toxinology Journal 3:101-118 and Morgan, et al., (2001) Applied and Envir. Micro. 67:2062-2069), U.S. Pat. Nos. 6,048,838, and 6,379,946; and δ-endotoxins including, but not limited to, the cry1, cry2, cry3, cry4, cry5, cry6, cry7, cry8, cry9, cry10, cry11, cry12, cry13, cry14, cry15, cry16, cry17, cry18, cry19, cry20, cry21, cry22, cry23, cry24, cry25, cry26, cry27, cry 28, cry 29, cry 30, cry31, cry32, cry33, cry34, cry35,cry36, cry37, cry38, cry39, cry40, cry41, cry42, cry43, cry44, cry45, cry 46, cry47, cry49, cry 51 and cry55 classes of δ-endotoxin genes and the B. thuringiensis cytolytic cyt1 and cyt2 genes. Members of these classes of B. thuringiensis insecticidal proteins include, but are not limited to Cry1Aa1 (Accession #M11250), Cry1Aa2 (Accession #M10917), Cry1Aa3 (Accession #D00348), Cry1Aa4 (Accession #X13535), Cry1Aa5 (Accession #D17518), Cry1Aa6 (Accession #U43605), Cry1Aa7 (Accession #AF081790), Cry1Aa8 (Accession #126149), Cry1Aa9 (Accession #AB026261), Cry1Aa10 (Accession #AF154676), Cry1Aa11 (Accession #Y09663), Cry1Aa12 (Accession #AF384211), Cry1Aa13 (Accession #AF510713), Cry1Aa14 (Accession #AY197341), Cry1Aa15 (Accession #D0062690), Cry1Ab1 (Accession #M13898), Cry1Ab2 (Accession #M12661), Cry1Ab3 (Accession #M15271), Cry1Ab4 (Accession #D00117), Cry1Ab5 (Accession #X04698), Cry1Ab6 (Accession #M37263), Cry1Ab7 (Accession #X13233), Cry1Ab8 (Accession #M16463), Cry1Ab9 (Accession #X54939), Cry1Ab10 (Accession #A29125), Cry1Ab11 (Accession #112419), Cry1Ab12 (Accession #AF059670), Cry1Ab13 (Accession #AF254640), Cry1Ab14 (Accession #U94191), Cry1Ab15 (Accession #AF358861), Cry1Ab16 (Accession #AF375608), Cry1Ab17 (Accession #AAT46415), Cry1Ab18 (Accession #AAQ88259), Cry1Ab19 (Accession #AY847289), Cry1Ab20 (Accession #D0241675), Cry1Ab21 (Accession #EF683163), Cry1Ab22 (Accession #ABW87320), Cry1Ab-like (Accession #AF327924), Cry1Ab-like (Accession #AF327925), Cry1Ab-like (Accession #AF327926), Cry1Ab-like (Accession #D0781309), Cry1Ac1 (Accession #M11068), Cry1Ac2 (Accession #M35524), Cry1Ac3 (Accession #X54159), Cry1Ac4 (Accession #M73249), Cry1Ac5 (Accession #M73248), Cry1Ac6 (Accession #U43606), Cry1Ac7 (Accession #U87793), Cry1Ac8 (Accession #U87397), Cry1Ac9 (Accession #U89872), Cry1Ac10 (Accession #AJ002514), Cry1Ac11 (Accession #AJ130970), Cry1Ac12 (Accession #112418), Cry1Ac13 (Accession #AF148644), Cry1Ac14 (Accession #AF492767), Cry1Ac15 (Accession #AY122057), Cry1Ac16 (Accession #AY730621), Cry1Ac17 (Accession #AY925090), Cry1Ac18 (Accession #D0023296), Cry1Ac19 (Accession #D0195217), Cry1Ac20 (Accession #D0285666), Cry1Ac21 (Accession #D0062689), Cry1Ac22 (Accession #EU282379), Cry1Ac23 (Accession #AM949588), Cry1Ac24 (Accession #ABL01535), Cry1Ad1 (Accession #M73250), Cry1Ad2 (Accession #A27531), Cry1Ae1 (Accession #M65252), Cry1Af1 (Accession #U82003), Cry1Ag1 (Accession #AF081248), Cry1Ah1 (Accession #AF281866), Cry1Ah2 (Accession #D0269474), Cry1Ai1 (Accession #AY174873), Cry1A-like (Accession #AF327927), Cry1Ba1 (Accession #X06711), Cry1Ba2 (Accession #X95704), Cry1Ba3 (Accession #AF368257), Cry1 Ba4 (Accession #AF363025), Cry1 Ba5 (Accession #AB020894), Cry1 Ba6 (Accession #ABL60921), Cry1 Bb1 (Accession #L32020), Cry1Bc1 (Accession #Z46442), Cry1Bd1 (Accession #U70726), Cry1Bd2 (Accession #AY138457), Cry1Be1 (Accession #AF077326), Cry1Be2 (Accession #AAQ52387), Cry1Bf1 (Accession #AX189649), Cry1Bf2 (Accession #AAQ52380), Cry1Bg1 (Accession #AY176063), Cry1Ca1 (Accession #X07518), Cry1Ca2 (Accession #X13620), Cry1Ca3 (Accession #M73251), Cry1Ca4 (Accession #A27642), Cry1Ca5 (Accession #X96682), Cry1Ca6 [1] (Accession #AF215647), Cry1Ca7 (Accession #AY015492), Cry1Ca8 (Accession #AF362020), Cry1Ca9 (Accession #AY078160), Cry1Ca10 (Accession #AF540014), Cry1Ca11 (Accession #AY955268), Cry1Cb1 (Accession #M97880), Cry1Cb2 (Accession #AY007686), Cry1Cb3 (Accession #EU679502), Cry1Cb-like (Accession #AAX63901), Cry1Da1 (Accession #X54160), Cry1Da2 (Accession #176415), Cry1Db1 (Accession #Z22511), Cry1Db2 (Accession #AF358862), Cry1Dc1 (Accession #EF059913), Cry1Ea1 (Accession #X53985), Cry1Ea2 (Accession #X56144), Cry1 Ea3 (Accession #M73252), Cry1 Ea4 (Accession #U94323), Cry1 Ea5 (Accession #A15535), Cry1 Ea6 (Accession #AF202531), Cry1 Ea7 (Accession #AAW72936), Cry1 Ea8 (Accession #ABX11258), Cry1Eb1 (Accession #M73253), Cry1Fa1 (Accession #M63897), Cry1Fa2 (Accession #M73254), Cry1Fb1 (Accession #Z22512), Cry1Fb2 (Accession #AB012288), Cry1Fb3 (Accession #AF062350), Cry1Fb4 (Accession #173895), Cry1Fb5 (Accession #AF336114), Cry1Fb6 (Accession #EU679500), Cry1Fb7 (Accession #EU679501), Cry1Ga1 (Accession #Z22510), Cry1Ga2 (Accession #Y09326), Cry1Gb1 (Accession #U70725), Cry1Gb2 (Accession #AF288683), Cry1Gc (Accession #AAQ52381), Cry1Ha1 (Accession #Z22513), Cry1Hb1 (Accession #U35780), Cry1H-like (Accession #AF182196), Cry1Ia1 (Accession #X62821), Cry1Ia2 (Accession #M98544), Cry1Ia3 (Accession #L36338), Cry1Ia4 (Accession #L49391), Cry1Ia5 (Accession #Y08920), Cry1Ia6 (Accession #AF076953), Cry1Ia7 (Accession #AF278797), Cry1Ia8 (Accession #AF373207), Cry1Ia9 (Accession #AF521013), Cry1Ia10 (Accession #AY262167), Cry1Ia11 (Accession #AJ315121), Cry1Ia12 (Accession #AAV53390), Cry1Ia13 (Accession #ABF83202), Cry1Ia14 (Accession #EU887515), Cry1Ib1 (Accession #U07642), Cry1Ib2 (Accession #ABW88019), Cry1Ib3 (Accession #EU677422), Cry1Ic1 (Accession #AF056933), Cry1Ic2 (Accession #AAE71691), Cry1Id1 (Accession #AF047579), Cry1Ie1 (Accession #AF211190), Cry1If1 (Accession #AAQ52382), Cry1I-like (Accession #190732), Cry1I-like (Accession #DQ781310), Cry1Ja1 (Accession #L32019), Cry1Jb1 (Accession #U31527), Cry1Jc1 (Accession #190730), Cry1Jc2 (Accession #AAQ52372), Cry1Jd1 (Accession #AX189651), Cry1Ka1 (Accession #U28801), Cry1La1 (Accession #AAS60191), Cry1-like (Accession #190729), Cry2Aa1 (Accession #M31738), Cry2Aa2 (Accession #M23723), Cry2Aa3 (Accession #D86064), Cry2Aa4 (Accession #AF047038), Cry2Aa5 (Accession #AJ132464), Cry2Aa6 (Accession #AJ132465), Cry2Aa7 (Accession #AJ132463), Cry2Aa8 (Accession #AF252262), Cry2Aa9 (Accession #AF273218), Cry2Aa10 (Accession #AF433645), Cry2Aa11 (Accession #AAQ52384), Cry2Aa12 (Accession #DQ977646), Cry2Aa13 (Accession #ABL01536), Cry2Aa14 (Accession #ACF04939), Cry2Ab1 (Accession #M23724), Cry2Ab2 (Accession #X55416), Cry2Ab3 (Accession #AF164666), Cry2Ab4 (Accession #AF336115), Cry2Ab5 (Accession #AF441855), Cry2Ab6 (Accession #AY297091), Cry2Ab7 (Accession #DQ119823), Cry2Ab8 (Accession #DQ361266), Cry2Ab9 (Accession #DQ341378), Cry2Ab10 (Accession #EF157306), Cry2Ab11 (Accession #AM691748), Cry2Ab12 (Accession #ABM21764), Cry2Ab13 (Accession #EU909454), Cry2Ab14 (Accession #EU909455), Cry2Ac1 (Accession #X57252), Cry2Ac2 (Accession #AY007687), Cry2Ac3 (Accession #AAQ52385), Cry2Ac4 (Accession #DQ361267), Cry2Ac5 (Accession #DQ341379), Cry2Ac6 (Accession #DQ359137), Cry2Ac7 (Accession #AM292031), Cry2Ac8 (Accession #AM421903), Cry2Ac9 (Accession #AM421904), Cry2Ac10 (Accession #BI 877475), Cry2Ac11 (Accession #AM689531), Cry2Ac12 (Accession #AM689532), Cry2Ad1 (Accession #AF200816), Cry2Ad2 (Accession #DQ358053), Cry2Ad3 (Accession #AM268418), Cry2Ad4 (Accession #AM490199), Cry2Ad5 (Accession #AM765844), Cry2Ae1 (Accession #AAQ52362), Cry2Af1 (Accession #EF439818), Cry2Ag (Accession #ACH91610), Cry2Ah (Accession #EU939453), Cry3Aa1 (Accession #M22472), Cry3Aa2 (Accession #J02978), Cry3Aa3 (Accession #Y00420), Cry3Aa4 (Accession #M30503), Cry3Aa5 (Accession #M37207), Cry3Aa6 (Accession #U10985), Cry3Aa7 (Accession #AJ237900), Cry3Aa8 (Accession #AAS79487), Cry3Aa9 (Accession #AAW05659), Cry3Aa10 (Accession #AAU29411), Cry3Aa11 (Accession #AY882576), Cry3Aa12 (Accession #ABY49136), Cry3Ba1 (Accession #X17123), Cry3Ba2 (Accession #A07234), Cry3Bb1 (Accession #M89794), Cry3Bb2 (Accession #U31633), Cry3Bb3 (Accession #115475), Cry3Ca1 (Accession #X59797), Cry4Aa1 (Accession #Y00423), Cry4Aa2 (Accession #D00248), Cry4Aa3 (Accession #AL731825), Cry4A-like (Accession #DQ078744), Cry4Ba1 (Accession #X07423), Cry4Ba2 (Accession #X07082), Cry4Ba3 (Accession #M20242), Cry4Ba4 (Accession #D00247), Cry4Ba5 (Accession #AL731825), Cry4Ba-like (Accession #ABC47686), Cry4Ca1 (Accession #EU646202), Cry5Aa1 (Accession #L07025), Cry5Ab1 (Accession #L07026), Cry5Ac1 (Accession #134543), Cry5Ad1 (Accession #EF219060), Cry5Ba1 (Accession #U19725), Cry5Ba2 (Accession #EU121522), Cry6Aa1 (Accession #L07022), Cry6Aa2 (Accession #AF499736), Cry6Aa3 (Accession #DQ835612), Cry6Ba1 (Accession #L07024), Cry7Aa1 (Accession #M64478), Cry7Ab1 (Accession #U04367), Cry7Ab2 (Accession #U04368), Cry7Ab3 (Accession #BI 1015188), Cry7Ab4 (Accession #EU380678), Cry7Ab5 (Accession #ABX79555), Cry7Ab6 (Accession #FJ194973), Cry7Ba1 (Accession #ABB70817), Cry7Ca1 (Accession #EF486523), Cry8Aa1 (Accession #U04364), Cry8Ab1 (Accession #EU044830), Cry8Ba1 (Accession #U04365), Cry8Bb1 (Accession #AX543924), Cry8Bc1 (Accession #AX543926), Cry8Ca1 (Accession #U04366), Cry8Ca2 (Accession #AAR98783), Cry8Ca3 (Accession #EU625349), Cry8Da1 (Accession #AB089299), Cry8Da2 (Accession #BD133574), Cry8Da3 (Accession #BD133575), Cry8Db1 (Accession #AB303980), Cry8Ea1 (Accession #AY329081), Cry8Ea2 (Accession #EU047597), Cry8Fa1 (Accession #AY551093), Cry8Ga1 (Accession #AY590188), Cry8Ga2 (Accession #DQ318860), Cry8Ga3 (Accession #FJ198072), Cry8Ha1 (Accession #EF465532), Cry8Ia1 (Accession #EU381044), Cry8Ja1 (Accession #EU625348), Cry8 like (Accession #ABS53003), Cry9Aa1 (Accession #X58120), Cry9Aa2 (Accession #X58534), Cry9Aa like (Accession #AAQ52376), Cry9Ba1 (Accession #X75019), Cry9Bb1 (Accession #AY758316), Cry9Ca1 (Accession #Z37527), Cry9Ca2 (Accession #AAQ52375), Cry9Da1 (Accession #D85560), Cry9Da2 (Accession #AF042733), Cry9Db1 (Accession #AY971349), Cry9Ea1 (Accession #AB011496), Cry9Ea2 (Accession #AF358863), Cry9Ea3 (Accession #EF157307), Cry9Ea4 (Accession #EU760456), Cry9Ea5 (Accession #EU789519), Cry9Ea6 (Accession #EU887516), Cry9Eb1 (Accession #AX189653), Cry9Ec1 (Accession #AF093107), Cry9Ed1 (Accession #AY973867), Cry9 like (Accession #AF093107), Cry10Aa1 (Accession #M12662), Cry10Aa2 (Accession #E00614), Cry10Aa3 (Accession #AL731825), Cry10A like (Accession #DQ167578), Cry11Aa1 (Accession #M31737), Cry11Aa2 (Accession #M22860), Cry11Aa3 (Accession #AL731825), Cry11Aa-like (Accession #DQ166531), Cry11Ba1 (Accession #X86902), Cry11Bb1 (Accession #AF017416), Cry12Aa1 (Accession #L07027), Cry13Aa1 (Accession #L07023), Cry14Aa1 (Accession #U13955), Cry15Aa1 (Accession #M76442), Cry16Aa1 (Accession #X94146), Cry17Aa1 (Accession #X99478), Cry18Aa1 (Accession #X99049), Cry18Ba1 (Accession #AF169250), Cry18Ca1 (Accession #AF169251), Cry19Aa1 (Accession #Y07603), Cry19Ba1 (Accession #D88381), Cry20Aa1 (Accession #U82518), Cry21Aa1 (Accession #132932), Cry21Aa2 (Accession #166477), Cry21Ba1 (Accession #AB088406), Cry22Aa1 (Accession #134547), Cry22Aa2 (Accession #AX472772), Cry22Aa3 (Accession #EU715020), Cry22Ab1 (Accession #AAK50456), Cry22Ab2 (Accession #AX472764), Cry22Ba1 (Accession #AX472770), Cry23Aa1 (Accession #AAF76375), Cry24Aa1 (Accession #U88188), Cry24Ba1 (Accession #BAD32657), Cry24Ca1 (Accession #AM158318), Cry25Aa1 (Accession #U88189), Cry26Aa1 (Accession #AF122897), Cry27Aa1 (Accession #AB023293), Cry28Aa1 (Accession #AF132928), Cry28Aa2 (Accession #AF285775), Cry29Aa1 (Accession #AJ251977), Cry30Aa1 (Accession #AJ251978), Cry30Ba1 (Accession #BAD00052), Cry30Ca1 (Accession #BAD67157), Cry30Da1 (Accession #EF095955), Cry30Db1 (Accession #BAE80088), Cry30Ea1 (Accession #EU503140), Cry30Fa1 (Accession #EU751609), Cry30Ga1 (Accession #EU882064), Cry31Aa1 (Accession #AB031065), Cry31Aa2 (Accession #AY081052), Cry31Aa3 (Accession #AB250922), Cry31Aa4 (Accession #AB274826), Cry31Aa5 (Accession #AB274827), Cry31Ab1 (Accession #AB250923), Cry31Ab2 (Accession #AB274825), Cry31Ac1 (Accession #AB276125), Cry32Aa1 (Accession #AY008143), Cry32Ba1 (Accession #BAB78601), Cry32Ca1 (Accession #BAB78602), Cry32Da1 (Accession #BAB78603), Cry33Aa1 (Accession #AAL26871), Cry34Aa1 (Accession #AAG50341), Cry34Aa2 (Accession #AAK64560), Cry34Aa3 (Accession #AY536899), Cry34Aa4 (Accession #AY536897), Cry34Ab1 (Accession #AAG41671), Cry34Ac1 (Accession #AAG50118), Cry34Ac2 (Accession #AAK64562), Cry34Ac3 (Accession #AY536896), Cry34Ba1 (Accession #AAK64565), Cry34Ba2 (Accession #AY536900), Cry34Ba3 (Accession #AY536898), Cry35Aa1 (Accession #AAG50342), Cry35Aa2 (Accession #AAK64561), Cry35Aa3 (Accession #AY536895), Cry35Aa4 (Accession #AY536892), Cry35Ab1 (Accession #AAG41672), Cry35Ab2 (Accession #AAK64563), Cry35Ab3 (Accession #AY536891), Cry35Ac1 (Accession #AAG50117), Cry35Ba1 (Accession #AAK64566), Cry35Ba2 (Accession #AY536894), Cry35Ba3 (Accession #AY536893), Cry36Aa1 (Accession #AAK64558), Cry37Aa1 (Accession #AAF76376), Cry38Aa1 (Accession #AAK64559), Cry39Aa1 (Accession #BAB72016), Cry40Aa1 (Accession #BAB72018), Cry40Ba1 (Accession #BAC77648), Cry40Ca1 (Accession #EU381045), Cry40Da1 (Accession #EU596478), Cry41Aa1 (Accession #AB116649), Cry41Ab1 (Accession #AB116651), Cry42Aa1 (Accession #AB116652), Cry43Aa1 (Accession #AB115422), Cry43Aa2 (Accession #AB176668), Cry43Ba1 (Accession #AB115422), Cry43-like (Accession #AB115422), Cry44Aa (Accession #BAD08532), Cry45Aa (Accession #BAD22577), Cry46Aa (Accession #BAC79010), Cry46Aa2 (Accession #BAG68906), Cry46Ab (Accession #BAD35170), Cry47Aa (Accession #AY950229), Cry48Aa (Accession #AJ841948), Cry48Aa2 (Accession #AM237205), Cry48Aa3 (Accession #AM237206), Cry48Ab (Accession #AM237207), Cry48Ab2 (Accession #AM237208), Cry49Aa (Accession #AJ841948), Cry49Aa2 (Accession #AM237201), Cry49Aa3 (Accession #AM237203), Cry49Aa4 (Accession #AM237204), Cry49Ab1 (Accession #AM237202), Cry50Aa1 (Accession #AB253419), Cry51Aa1 (Accession #DQ836184), Cry52Aa1 (Accession #EF613489), Cry53Aa1 (Accession #EF633476), Cry54Aa1 (Accession #EU339367), Cry55Aa1 (Accession #EU121521), Cry55Aa2 (Accession #AAE33526), Cyt1Aa (GenBank Accession #X03182), Cyt1Ab (GenBank Accession #X98793), Cyt1B (GenBank Accession #U37196), Cyt2A (GenBank Accession #Z14147), Cyt2B (GenBank Accession #U52043).

Examples of δ-endotoxins also include but are not limited to Cry1A proteins of U.S. Pat. Nos. 5,880,275 and 7,858,849; a DIG-3 or DIG-11 toxin (N-terminal deletion of α-helix 1 and/or α-helix 2 variants of cry proteins such as Cry1A, Cry3A) of U.S. Pat. Nos. 8,304,604 and 8,304,605, Cry1B of U.S. patent application Ser. No. 10/525,318; Cry1C of U.S. Pat. No. 6,033,874; Cry1F of U.S. Pat. Nos. 5,188,960, 6,218,188; Cry1A/F chimeras of U.S. Pat. Nos. 7,070,982; 6,962,705 and 6,713,063); a Cry2 protein such as Cry2Ab protein of U.S. Pat. No. 7,064,249); a Cry3A protein including but not limited to an engineered hybrid insecticidal protein (eHIP) created by fusing unique combinations of variable regions and conserved blocks of at least two different Cry proteins (US Patent Application Publication Number 2010/0017914); a Cry4 protein; a Cry5 protein; a Cry6 protein; Cry8 proteins of U.S. Pat. Nos. 7,329,736, 7,449,552, 7,803,943, 7,476,781, 7,105,332, 7,378,499 and 7,462,760; a Cry9 protein such as such as members of the Cry9A, Cry9B, Cry9C, Cry9D, Cry9E, and Cry9F families; a Cry15 protein of Naimov, et al., (2008) Applied and Environmental Microbiology 74:7145-7151; a Cry22, a Cry34Ab1 protein of U.S. Pat. Nos. 6,127,180, 6,624,145 and 6,340,593; a CryET33 and cryET34 protein of U.S. Pat. Nos. 6,248,535, 6,326,351, 6,399,330, 6,949,626, 7,385,107 and 7,504,229; a CryET33 and CryET34 homologs of US Patent Publication Number 2006/0191034, 2012/0278954, and PCT Publication Number WO 2012/139004; a Cry35Ab1 protein of U.S. Pat. Nos. 6,083,499, 6,548,291 and 6,340,593; a Cry46 protein, a Cry 51 protein, a Cry binary toxin; a TIC901 or related toxin; TIC807 of US 2008/0295207; ET29, ET37, TIC809, TIC810, TIC812, TIC127, TIC128 of PCT US 2006/033867; AXMI-027, AXMI-036, and AXMI-038 of U.S. Pat. No. 8,236,757; AXMI-031, AXMI-039, AXMI-040, AXMI-049 of U.S. Pat. No. 7,923,602; AXMI-018, AXMI-020, and AXMI-021 of WO 2006/083891; AXMI-010 of WO 2005/038032; AXMI-003 of WO 2005/021585; AXMI-008 of US 2004/0250311; AXMI-006 of US 2004/0216186; AXMI-007 of US 2004/0210965; AXMI-009 of US 2004/0210964; AXMI-014 of US 2004/0197917; AXMI-004 of US 2004/0197916; AXMI-028 and AXMI-029 of WO 2006/119457; AXMI-007, AXMI-008, AXMI-0080rf2, AXMI-009, AXMI-014 and AXMI-004 of WO 2004/074462; AXMI-150 of U.S. Pat. No. 8,084,416; AXMI-205 of US20110023184; AXMI-011, AXMI-012, AXMI-013, AXMI-015, AXMI-019, AXMI-044, AXMI-037, AXMI-043, AXMI-033, AXMI-034, AXMI-022, AXMI-023, AXMI-041, AXMI-063, and AXMI-064 of US 2011/0263488; AXMI-R1 and related proteins of US 2010/0197592; AXMI221Z, AXMI222z, AXMI223z, AXMI224z and AXMI225z of WO 2011/103248; AXMI218, AXMI219, AXMI220, AXMI226, AXMI227, AXMI228, AXMI229, AXMI230, and AXMI231 of WO11/103247; AXMI-115, AXMI-113, AXMI-005, AXMI-163 and AXMI-184 of U.S. Pat. No. 8,334,431; AXMI-001, AXMI-002, AXMI-030, AXMI-035, and AXMI-045 of US 2010/0298211; AXMI-066 and AXMI-076 of US20090144852; AXMI128, AXMI130, AXMI131, AXMI133, AXMI140, AXMI141, AXMI142, AXMI143, AXMI144, AXMI146, AXMI148, AXMI149, AXMI152, AXMI153, AXMI154, AXMI155, AXMI156, AXMI157, AXMI158, AXMI162, AXMI165, AXMI166, AXMI167, AXMI168, AXMI169, AXMI170, AXMI171, AXMI172, AXMI173, AXMI174, AXMI175, AXMI176, AXMI177, AXMI178, AXMI179, AXMI180, AXMI181, AXMI182, AXMI185, AXMI186, AXMI187, AXMI188, AXMI189 of U.S. Pat. No. 8,318,900; AXMI079, AXMI080, AXMI081, AXMI082, AXMI091, AXMI092, AXMI096, AXMI097, AXMI098, AXMI099, AXMI100, AXMI101, AXMI102, AXMI103, AXMI104, AXMI107, AXMI108, AXMI109, AXMI110, AXMI111, AXMI112, AXMI114, AXMI116, AXMI117, AXMI118, AXMI119, AXMI120, AXMI121, AXMI122, AXMI123, AXMI124, AXMI1257, AXMI1268, AXMI127, AXMI129, AXMI164, AXMI151, AXMI161, AXMI183, AXMI132, AXMI138, AXMI137 of US 2010/0005543; cry proteins such as Cry1A and Cry3A having modified proteolytic sites of U.S. Pat. No. 8,319,019; a Cry1Ac, Cry2Aa and Cry1Ca toxin protein from Bacillus thuringiensis strain VBTS 2528 of US Patent Application Publication Number 2011/0064710. Other Cry proteins are well known to one skilled in the art (see, Crickmore, et al., “Bacillus thuringiensis toxin nomenclature” (2011), at lifesci.sussex.ac.uk/home/Neil_Crickmore/Bt/ which can be accessed on the world-wide web using the “www” prefix). The insecticidal activity of Cry proteins is well known to one skilled in the art (for review, see, van Frannkenhuyzen, (2009) J. Invert. Path. 101:1-16). The use of Cry proteins as transgenic plant traits is well known to one skilled in the art and Cry-transgenic plants including but not limited to Cry1Ac, Cry1Ac+Cry2Ab, Cry1Ab, Cry1A.105, Cry1F, Cry1Fa2, Cry1F+Cry1Ac, Cry2Ab, Cry3A, mCry3A, Cry3Bb1, Cry34Ab1, Cry35Ab1, Vip3A, mCry3A, Cry9c and CBI-Bt have received regulatory approval (see, Sanahuja, (2011) Plant Biotech Journal 9:283-300 and the CERA (2010) GM Crop Database Center for Environmental Risk Assessment (CERA), ILSI Research Foundation, Washington D.C. at cera-gmc.org/index.php?action=gm_crop_database which can be accessed on the world-wide web using the “www” prefix). More than one pesticidal proteins well known to one skilled in the art can also be expressed in plants such as Vip3Ab & Cry1Fa (US2012/0317682), Cry1BE & Cry1F (US2012/0311746), Cry1CA & Cry1AB (US2012/0311745), Cry1F & CryCa (US2012/0317681), Cry1DA & Cry1BE (US2012/0331590), Cry1DA & Cry1Fa (US2012/0331589), Cry1AB & Cry1BE (US2012/0324606), and Cry1Fa & Cry2Aa, Cry1I or Cry1E (US2012/0324605). Pesticidal proteins also include insecticidal lipases including lipid acyl hydrolases of U.S. Pat. No. 7,491,869, and cholesterol oxidases such as from Streptomyces (Purcell et al. (1993) Biochem Biophys Res Commun 15:1406-1413). Pesticidal proteins also include VIP (vegetative insecticidal proteins) toxins of U.S. Pat. Nos. 5,877,012, 6,107,279 6,137,033, 7,244,820, 7,615,686, and 8,237,020 and the like. Other VIP proteins are well known to one skilled in the art (see, lifesci.sussex.ac.uk/home/Neil_Crickmore/Bt/vip.html which can be accessed on the world-wide web using the “www” prefix). Pesticidal proteins also include toxin complex (TC) proteins, obtainable from organisms such as Xenorhabdus, Photorhabdus and Paenibacillus (see, U.S. Pat. Nos. 7,491,698 and 8,084,418). Some TC proteins have “stand alone” insecticidal activity and other TC proteins enhance the activity of the stand-alone toxins produced by the same given organism. The toxicity of a “stand-alone” TC protein (from Photorhabdus, Xenorhabdus or Paenibacillus, for example) can be enhanced by one or more TC protein “potentiators” derived from a source organism of a different genus. There are three main types of TC proteins. As referred to herein, Class A proteins (“Protein A”) are stand-alone toxins. Class B proteins (“Protein B”) and Class C proteins (“Protein C”) enhance the toxicity of Class A proteins. Examples of Class A proteins are TcbA, TcdA, XptA1 and XptA2. Examples of Class B proteins are TcaC, TcdB, XptB1Xb and XptC1Wi. Examples of Class C proteins are TccC, XptC1Xb and XptB1Wi. Pesticidal proteins also include spider, snake and scorpion venom proteins. Examples of spider venom peptides include but not limited to lycotoxin-1 peptides and mutants thereof (U.S. Pat. No. 8,334,366).

In some embodiments the PHI-4 polypeptides include amino acid sequences deduced from the full-length nucleic acid sequences disclosed herein, and amino acid sequences that are shorter than the full-length sequences, either due to the use of an alternate downstream start site or due to processing that produces a shorter protein having pesticidal activity. Processing may occur in the organism the protein is expressed in, or in the pest after ingestion of the protein.

Thus, provided herein are novel isolated or recombinant nucleic acid sequences that confer pesticidal activity. Also provided are the amino acid sequences of PHI-4 polypeptides. The protein resulting from translation of these PHI-4 polypeptide genes allows cells to control or kill pests that ingest it.

Nucleic Acid Molecules, and Variants and Fragments Thereof

One aspect pertains to isolated or recombinant nucleic acid molecules comprising nucleic acid sequences encoding PHI-4 polypeptides and polypeptides or biologically active portions thereof, as well as nucleic acid molecules sufficient for use as hybridization probes to identify nucleic acid molecules encoding proteins with regions of sequence homology. As used herein, the term “nucleic acid molecule” is intended to include DNA molecules (e.g., recombinant DNA, cDNA, genomic DNA, plastid DNA, mitochondrial DNA) and RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can be single-stranded or double-stranded, but preferably is double-stranded DNA.

An “isolated” or “recombinant” nucleic acid molecule (or DNA) is used herein to refer to a nucleic acid sequence (or DNA) that is no longer in its natural environment, for example in an in vitro or in a recombinant bacterial or plant host cell. In some embodiments, an “isolated” or “recombinant” nucleic acid is free of sequences (preferably protein encoding sequences) that naturally flank the nucleic acid (i.e., sequences located at the 5′ and 3′ ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For purposes of the disclosure, “isolated” or “recombinant” when used to refer to nucleic acid molecules excludes isolated chromosomes. For example, in various embodiments, the recombinant nucleic acid molecule encoding a PHI-4 polypeptide can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleic acid sequences that naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived.

A variety of polynucleotides encoding a PHI-4 polypeptide(s) or related proteins are contemplated. Such polynucleotides are useful for production of PHI-4 polypeptides in host cells when operably linked to suitable promoter, transcription termination and/or polyadenylation sequences. Such polynucleotides are also useful as probes for isolating homologous or substantially homologous polynucleotides encoding PHI-4 polypeptides or related proteins.

The present invention provides isolated or recombinant polynucleotides that encode any of the PHI-4 polypeptides disclosed herein. Those having ordinary skill in the art will readily appreciate that due to the degeneracy of the genetic code, a multitude of nucleotide sequences encoding β-glucosidase polypeptides of the present invention exist. Table 1 is a Codon Table that provides the synonymous codons for each amino acid. For example, the codons AGA, AGG, CGA, CGC, CGG, and CGU all encode the amino acid arginine. Thus, at every position in the nucleic acids of the invention where an arginine is specified by a codon, the codon can be altered to any of the corresponding codons described above without altering the encoded polypeptide. It is understood that U in an RNA sequence corresponds to T in a DNA sequence.

TABLE 1 Alanine Ala A GCA GCC GCG GCU Cysteine Cys C UGC UGU Aspartic acid Asp D GAC GAU Glutamic acid Glu E GAA GAG Phenylalanine Phe F UUC UUU Glycine Gly G GGA GGC GGG GGU Histidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Lysine Lys K AAA AAG Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUG Asparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gln Q CAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S AGC AGU UCA UCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUU Tryptophan Trp W UGG Tyrosine Tyr Y UAC UAU

Such “silent variations” are one species of “conservative” variation. One of ordinary skill in the art will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine) can be modified by standard techniques to encode a functionally identical polypeptide. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide is implicit in any described sequence. The invention contemplates and provides each and every possible variation of nucleic acid sequence encoding a polypeptide of the invention that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code (set forth in Table 1), as applied to the polynucleotide sequences of the present invention.

A group of two or more different codons that, when translated in the same context, all encode the same amino acid, are referred to herein as “synonymous codons.” Polynucleotides encoding PHI-4 polypeptides of the present disclosure may be codon optimized for expression in a particular host organism by modifying the polynucleotides to conform with the optimum codon usage of the desired host organism. Those having ordinary skill in the art will recognize that tables and other references providing preference information for a wide range of organisms are readily available.

Polynucleotides encoding a PHI-4 polypeptide can also be synthesized de novo from a PHI-4 polypeptide sequence. The sequence of the polynucleotide gene can be deduced from a PHI-4 polypeptide sequence through use of the genetic code. Computer programs such as “BackTranslate” (GCG™ Package, Acclerys, Inc. San Diego, Calif.) can be used to convert a peptide sequence to the corresponding nucleotide sequence encoding the peptide. Examples of PHI-4 polypeptide sequences that can be used to obtain corresponding nucleotide encoding sequences include, but are not limited to, the PHI-4 polypeptide sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NOs: 51-819. Furthermore, synthetic PHI-4 polynucleotide sequences of the invention can be designed so that they will be expressed in plants. U.S. Pat. No. 5,500,365 describes a method for synthesizing plant genes to improve the expression level of the protein encoded by the synthesized gene. This method relates to the modification of the structural gene sequences of the exogenous transgene, to cause them to be more efficiently transcribed, processed, translated and expressed by the plant. Features of genes that are expressed well in plants include elimination of sequences that can cause undesired intron splicing or polyadenylation in the coding region of a gene transcript while retaining substantially the amino acid sequence of the toxic portion of the insecticidal protein. A similar method for obtaining enhanced expression of transgenes in monocotyledonous plants is disclosed in U.S. Pat. No. 5,689,052.

In some embodiments the nucleic acid molecule encoding a PHI-4 polypeptide is a polynucleotide having the sequence set forth in SEQ ID NO: 1, SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID NOS: 24-30 and variants, fragments and complements thereof. By “complement” is intended a nucleic acid sequence that is sufficiently complementary to a given nucleic acid sequence such that it can hybridize to the given nucleic acid sequence to thereby form a stable duplex. In some embodiments the nucleic acid molecule encoding a PHI-4 polypeptide is a nucleic acid molecule having the sequence set forth in SEQ ID NO: 1, SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID NOS: 24-30. The corresponding amino acid sequences for the insecticidal protein encoded by these nucleic acid sequences are set forth in SEQ ID NO: 1, SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID NOS: 24-30.

In some embodiments the nucleic acid molecule encoding a PHI-4 polypeptide is a polynucleotide having a nucleotide sequence encoding a polypeptide comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater sequence identity to the amino acid sequence of SEQ ID NO: 35, wherein the polypeptide has pesticidal activity. In some embodiments the nucleic acid molecule encoding a PHI-4 polypeptide is a polynucleotide having a nucleotide sequence encoding a polypeptide comprising an amino acid sequence having 80% and 95% identity, to the amino acid sequence of SEQ ID NO: 35, wherein the polypeptide has pesticidal activity.

In some embodiments the nucleic acid molecule encoding a PHI-4 polypeptide is a polynucleotide having a nucleotide sequence encoding a polypeptide comprising an amino acid sequence having at least 80% identity, to the amino acid sequence of SEQ ID NO: 51-819, wherein the polypeptide has pesticidal activity.

In some embodiments the nucleic acid molecule encoding a PHI-4 polypeptide is a polynucleotide having a nucleotide sequence encoding a polypeptide comprising an amino acid sequence SEQ ID NO: 4, wherein Xaa at position 9 is Gln, Lys or Glu; Xaa at position 14 is Pro or Ala; Xaa at position 16 is Val or Asp; Xaa at position 19 is Met or Leu; Xaa at position 22 is Gly or Ser; Xaa at position 24 is Asp, Asn or Gln; Xaa at position 36 is Leu or Met; Xaa at position 42 is Asp, Asn or Gln; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu, Asp, Asn or Gly; Xaa at position 50 is Ile or Val; Xaa at position 51 is Glu or Gln; Xaa at position 55 is Arg or Lys; Xaa at position 56 is Ser or Thr; Xaa at position 57 is Tyr or Phe; Xaa at position 58 is Thr or Ser; Xaa at position 61 is Arg, Lys or Glu; Xaa at position 73 is Phe or Tyr; Xaa at position 74 is Lys, Glu, Gly, Arg, Met, Leu, His or Asp; Xaa at position 76 is Asp or Gln; Xaa at position 79 is Lys or Glu; Xaa at position 80 is Glu or Ser; Xaa at position 82 is Glu, Ile, Leu, Tyr or Gln; Xaa at position 83 is Glu or Gln; Xaa at position 84 is Tyr or Phe; Xaa at position 86 is Glu or Gln; Xaa at position 87 is Lys or Gln; Xaa at position 88 is Met, Ile or Leu; Xaa at position 90 is Gln or Glu; Xaa at position 94 is Val or Ile; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln, Gly or Ser; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile, Met, Phe, Cys, Val or Asn; Xaa at position 103 is Ala or Gly; Xaa at position 105 is Leu or Ile; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp, Asn, Glu, Cys, Ala or Arg; Xaa at position 112 is Thr or Ser; Xaa at position 113 is Asp, Glu or Met; Xaa at position 117 is Thr or Ser; Xaa at position 121 is Tyr or Phe; Xaa at position 127 is Ala or Thr; Xaa at position 142 is Arg or Glu; Xaa at position 146 is Arg or Gln; Xaa at position 147 is Arg, Glu or Gln; Xaa at position 148 is Asp, Phe, Pro, Val, Glu, His, Trp, Ala, Arg, Leu, Ser, Gln or Gly; Xaa at position 149 is Phe or Val; Xaa at position 150 is Arg, Gln or Glu; Xaa at position 151 is Asp, Ser, Ala, Asn, Trp, Val, Gln, Cys, Met, Leu, Arg or Glu; Xaa at position 153 is Leu or Ile; Xaa at position 154 is Asn or Asp; Xaa at position 155 is Asn or Lys; Xaa at position 159 is Pro or Asp; Xaa at position 162 is Glu, Asp or Gln; Xaa at position 165 is Lys, Glu, Gln, Pro, Thr, Ala, Leu, Gly, Asp, Val, His, Ile, Met, Trp, Phe, Tyr or Arg; Xaa at position 166 is Arg or Gln; Xaa at position 167 is Tyr, Trp or Cys; Xaa at position 170 is Tyr or His; Xaa at position 171 is Tyr or Phe; Xaa at position 172 is Ile, Leu or Val; Xaa at position 173 is Ser or Ala; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 183 is Tyr or Val; Xaa at position 184 is Ser or Thr; Xaa at position 185 is Ala or Ser; Xaa at position 189 is Thr, Lys or Ile; Xaa at position 191 is Lys or Gln; Xaa at position 193 is Asp or Asn; Xaa at position 196 is Gln, Lys, Asn, Asp, Glu, Ala, Ile or Arg; Xaa at position 202 is Ala or Val; Xaa at position 203 is Glu, Thr or His; Xaa at position 204 is Met or Ala; Xaa at position 206 is Tyr or Phe; Xaa at position 207 is Lys or Gln; Xaa at position 209 is Leu or Pro; Xaa at position 210 is Val or Ile; Xaa at position 214 is Lys, Ser or Gln; Xaa at position 216 is Glu, Gln, Phe, Val, Tyr or Arg; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser, Gln, Arg, Trp, Met, Ala, Phe, Ile, Leu, Cys or Asn; Xaa at position 229 is Arg or Glu; Xaa at position 230 is Ser or Glu; Xaa at position 231 is Asn or Ser; Xaa at position 236 is Leu or Pro; Xaa at position 245 is Met or Leu; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln, Lys or Glu; Xaa at position 257 is Gln, Ile, Glu, Cys, Ser, His, Trp or Met; Xaa at position 261 is Gln, Glu or Lys; Xaa at position 264 is Glu or Gln; Xaa at position 268 is Asp or Asn; Xaa at position 276 is Ser or Ala; Xaa at position 278 is Glu, Asn or Gln; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 282 is Pro or Gly; Xaa at position 284 is Trp or Arg; Xaa at position 287 is Ala or Cys; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr, Asp, Phe, Ser, Met, Arg, Trp, Ile, His, Asn, Cys, Gly or Ala; Xaa at position 291 is Glu or Gln; Xaa at position 292 is Arg or Gln; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 294 is Val or Ala; Xaa at position 296 is Leu or Ile; Xaa at position 297 is Glu or Gln; Xaa at position 298 is Asp or Gln; Xaa at position 300 is Phe or Tyr; Xaa at position 302 is Glu or Gln; Xaa at position 303 is Phe or Tyr; Xaa at position 305 is Lys or Gln; Xaa at position 306 is Gln or Lys; Xaa at position 309 is Gln, Lys or Glu; Xaa at position 313 is Lys, Gln or Arg; Xaa at position 316 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 331 is Glu, Asn or Gln; Xaa at position 333 is Ser, Arg, Gly, Lys, Val, Asn, Ala, His, Gln, Thr, Asp, Ile, Leu, Cys or Glu; Xaa at position 334 is Gly, Arg, Lys, Ile or Trp; Xaa at position 335 is Ser or Ala; Xaa at position 336 is Gly or Ala; Xaa at position 337 is Ala, Val or Gly; Xaa at position 338 is Ser, His, Val, Lys, Ala, Gly, Thr, Ile, Glu, Met, Arg, Pro, Asp, Asn or Leu; Xaa at position 339 is Glu, Asn, Gln, Ile, Pro, Met, Ser, Ala, Cys, Phe, Val, Leu, Asp, Trp, His or Arg; Xaa at position 341 is Leu or Val; Xaa at position 342 is Ala, Ser or Val; Xaa at position 343 is Val or Ile; Xaa at position 344 is Phe or Trp; Xaa at position 345 is Asn or His; Xaa at position 346 is Pro or Ala; Xaa at position 350 is Asn or Ser; Xaa at position 351 is Gly or Val; Xaa at position 354 is Met or Leu; Xaa at position 355 is Val, Ile or Leu; Xaa at position 359 is Gly or Ala; Xaa at position 362 is Asn or Ser; Xaa at position 364 is Ala or Ser; Xaa at position 371 is Ala, Gly or Thr; Xaa at position 374 is Phe or Ile; Xaa at position 375 is Lys or Arg; Xaa at position 380 is Leu or Gly; Xaa at position 382 is Val, Asp or Leu; Xaa at position 383 is Leu, Ile or Val; Xaa at position 384 is Lys, Ala or Gly; Xaa at position 385 is Ala or Gly; Xaa at position 389 is Trp or Tyr; Xaa at position 391 is Arg, Leu, Glu, Gln or Asp; Xaa at position 395 is Asp or Cys; Xaa at position 396 is Ala, Leu, Lys, Asn, Gly, Ile, Met, Arg, Tyr, Gln or His; Xaa at position 397 is Gly, Arg or Ala; Xaa at position 398 is Ser, Gln or Cys; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val, Arg, Ile, Asn, Phe, Thr, Ala, Asp, Met, Gln or Glu; Xaa at position 402 is Lys, Phe, His, Arg, Trp, Gly, Asn, Leu, Tyr, Thr, Val, Met, Pro or Ala; Xaa at position 403 is Asp, Tyr, Trp, Phe or Glu; Xaa at position 405 is Ala or Ser; Xaa at position 409 is Ala or Pro; Xaa at position 410 is Ile or Val; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg, Glu or Gln; Xaa at position 417 is Ala, Ser or Cys; Xaa at position 418 is Leu or Met; Xaa at position 422 is Met or Val; Xaa at position 426 is Thr or Ser; Xaa at position 436 is Asp or Lys; Xaa at position 437 is Tyr or Val; Xaa at position 438 is Val or Arg; Xaa at position 440 is Val or Leu; Xaa at position 442 is Gln, Lys or Glu; Xaa at position 445 is Cys, Leu or Thr; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu, Ile, Gly, Pro, Leu, Phe, Trp or Thr; Xaa at position 448 is Val or Ala; Xaa at position 449 is Gln or Glu; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 453 is Asn or Asp; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp, Gln, Gly, His, Asp, Leu, Thr, Pro or Asn; Xaa at position 455 is Val or Ile; Xaa at position 457 is Trp or Asn; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile, Thr, Ser, His, Cys, Tyr, Pro, Asn, Ala, Arg or Glu; Xaa at position 460 is Gly or Ala; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala, Ser or Gly; Xaa at position 464 is Arg, Gly, His, Gln, Thr or Phe; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly, Arg, Thr, His, Cys, Trp, Phe or Leu; Xaa at position 466 is Asp or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 497 is Asp or Gln; Xaa at position 499 is Glu or Gln; Xaa at position 500 is Arg, Gln or Lys; Xaa at position 502 is Arg, Glu or Gln; Xaa at position 509 is Lys, Gln or Glu; Xaa at position 517 is Gln, Cys, Asn, Val or Pro; Xaa at position 518 is Glu or Gln; Xaa at position 520 is Lys, Gln or Glu; Xaa at position 525 is Gln or Lys; and Xaa at position 527 is Gln, Lys, Pro, Cys, Glu, Ser, His, Phe or Trp; and having one or more amino acid substitutions at positions designated as Xaa in SEQ ID NO: 4 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35.

In some embodiments the nucleic acid molecule encoding a PHI-4 polypeptide is a polynucleotide having a nucleotide sequence encoding a polypeptide comprising an amino acid sequence SEQ ID NO: 3, wherein Xaa at position 24 is Asp or Asn; Xaa at position 42 is Asp or Asn; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu or Asn; Xaa at position 74 is Lys, Glu or Gly; Xaa at position 79 is Lys or Glu; Xaa at position 82 is Glu, Ile, Leu or Tyr; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln or Gly; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile or Met; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp or Asn; Xaa at position 147 is Arg or Glu; Xaa at position 148 is Asp, Phe or Pro; Xaa at position 150 is Arg or Gln; Xaa at position 151 is Asp, Ser, Ala or Asn; Xaa at position 153 is Leu or Ile; Xaa at position 162 is Glu or Gln; Xaa at position 165 is Lys, Glu or Gln; Xaa at position 166 is Arg or Gln; Xaa at position 171 is Tyr or Phe; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 196 is Gln, Lys, Asn or Asp; Xaa at position 203 is Glu, Thr or His; Xaa at position 206 is Tyr or Phe; Xaa at position 216 is Glu or Gln; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser or Gln; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln or Lys; Xaa at position 257 is Gln or Ile; Xaa at position 261 is Gln or Glu; Xaa at position 278 is Glu or Asn; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr or Asp; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 313 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 333 is Ser, Gly, Lys, Val or Asn; Xaa at position 334 is Gly, Arg, Lys or Ile; Xaa at position 336 is Gly or Ala; Xaa at position 338 is Ser, His, Val, Lys or Ala; Xaa at position 339 is Glu, Asn, Ile or Pro; Xaa at position 343 is Val or Ile; Xaa at position 346 is Pro or Ala; Xaa at position 355 is Val or Ile; Xaa at position 359 is Gly or Ala; Xaa at position 391 is Arg, Glu or Gln; Xaa at position 396 is Ala, Leu, Lys, Asn or Gly; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val or Arg; Xaa at position 402 is Lys, Phe, His, Arg, Gly, Trp, Thr, Asn, Tyr or Met; Xaa at position 403 is Asp or Tyr; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg or Glu; Xaa at position 417 is Ala or Ser; Xaa at position 418 is Leu or Met; Xaa at position 426 is Thr or Ser; Xaa at position 440 is Val or Leu; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu or Ile; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp or Gln; Xaa at position 455 is Val or Ile; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile or Tyr; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala or Ser; Xaa at position 464 is Arg, Gly or His; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 500 is Arg or Gln; Xaa at position 509 is Lys or Gln; Xaa at position 520 is Lys, Gln or Glu; and Xaa at position 527 is Gln, Lys, Pro, Cys or Glu; and having one or more amino acid substitutions at positions designated as Xaa in SEQ ID NO: 3 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35.

In some embodiments exemplary nucleic acid molecules encode a PHI-4 of SEQ ID NO: 51-819 as well as amino acid substitutions, amino acid deletions, insertions and fragments thereof and combinations thereof.

In some embodiments the nucleic acid molecules encode a PHI-4 polypeptide of Table 3, Table 4, Table 5, Table 6, Table 7, Megatable 1, and Megatable 2 and combinations of the amino acid substitutions thereof and amino acid deletions and/or insertions thereof.

Also provided are nucleic acid molecules that encode transcription and/or translation products that are subsequently spliced to ultimately produce functional PHI-4. Splicing can be accomplished in vitro or in vivo, and can involve cis- or trans-splicing. The substrate for splicing can be polynucleotides (e.g., RNA transcripts) or polypeptides. An example of cis-splicing of a polynucleotide is where an intron inserted into a coding sequence is removed and the two flanking exon regions are spliced to generate a PHI-4 encoding sequence. An example of trans splicing would be where a polynucleotide is encrypted by separating the coding sequence into two or more fragments that can be separately transcribed and then spliced to form the full-length pesticidal encoding sequence. The use of a splicing enhancer sequence, which can be introduced into a construct, can facilitate splicing either in cis or trans-splicing of polypeptides (U.S. Pat. Nos. 6,365,377 and 6,531,316). Thus, in some embodiments the polynucleotides do not directly encode a full-length PHI-4, but rather encode a fragment or fragments of a PHI-4. These polynucleotides can be used to express a functional PHI-4 through a mechanism involving splicing, where splicing can occur at the level of polynucleotide (e.g., intron/exon) and/or polypeptide (e.g., intein/extein). This can be useful, for example, in controlling expression of pesticidal activity, since functional pesticidal polypeptide will only be expressed if all required fragments are expressed in an environment that permits splicing processes to generate functional product. In another example, introduction of one or more insertion sequences into a polynucleotide can facilitate recombination with a low homology polynucleotide; use of an intron or intein for the insertion sequence facilitates the removal of the intervening sequence, thereby restoring function of the encoded variant.

Nucleic acid molecules that are fragments of these nucleic acid sequences encoding PHI-4 are also encompassed by the embodiments. By “fragment” is intended a portion of the nucleic acid sequence encoding a PHI-4. A fragment of a nucleic acid sequence may encode a biologically active portion of a PHI-4 polypeptide or it may be a fragment that can be used as a hybridization probe or PCR primer using methods disclosed below. Nucleic acid molecules that are fragments of a nucleic acid sequence encoding a PHI-4 comprise at least about 50, 100, 200, 300, 400, 500, 600 or 700, contiguous nucleotides, or up to the number of nucleotides present in a full-length nucleic acid sequence encoding a PHI-4 disclosed herein, depending upon the intended use. By “contiguous” nucleotides is intended nucleotide residues that are immediately adjacent to one another. Fragments of the nucleic acid sequences of the embodiments will encode protein fragments that retain the biological activity of the PHI-4 and, hence, retain insecticidal activity. By “retains activity” is intended that the fragment will encode a polypeptide having at least about 30%, at least about 50%, at least about 70%, 80%, 90%, 95% or higher of the insecticidal activity of the full-length PHI-4. In one embodiment, the insecticidal activity is Lepodoptera activity. In another embodiment, the insecticidal activity is Hemiptera activity.

In some embodiments a fragment of a nucleic acid sequence encoding a PHI-4 encoding a biologically active portion of a protein will encode at least about 15, 25, 30, 50, 75, 100, 125, 150, 175, 200, or 250, contiguous amino acids, or up to the total number of amino acids present in a full-length PHI-4 of the embodiments. In some embodiments, the fragment is an N-terminal or a C-terminal truncation of at least about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25 or more amino acids relative to SEQ ID NO: 35, SEQ ID NOs: 51-819 or variants thereof, e.g., by proteolysis, insertion of a start codon, deletion of the codons encoding the deleted amino acids with the concomitant insertion of a stop codon or by insertion of a stop codon in the coding sequence. In some embodiments, the fragments encompassed herein result from the removal of the N-terminal 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or more amino acids relative to SEQ ID NO: 35, SEQ ID NOs: 51-819 or variants thereof, e.g., by proteolysis or by insertion of a start codon in the coding sequence.

In some embodiments the PHI-4 are encoded by a nucleic acid sequence sufficiently identical to the nucleic acid sequence of SEQ ID NO: 1, SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID NOS: 24-30. By “sufficiently identical” is intended an amino acid or nucleic acid sequence that has at least about 60% or 65% sequence identity, about 70% or 75% sequence identity, about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater sequence identity compared to a reference sequence using one of the alignment programs described herein using standard parameters. In some embodiments the sequence homology identity is against the full length sequence of the polynucleotide encoding a PHI-4 or against the full length sequence of a PHI-4 polypeptide. In some embodiments the polynucleotide encoding the PHI-4 has at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater sequence identity compared to SEQ ID NO: 1, SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID NOS: 24-30. One of skill in the art will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleic acid sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning, and the like.

To determine the percent identity of two amino acid sequences or of two nucleic acids, the sequences are aligned for optimal comparison purposes. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e., percent identity=number of identical positions/total number of positions (e.g., overlapping positions)×100). In one embodiment, the two sequences are the same length. In another embodiment, the comparison is across the entirety of the reference sequence (e.g., across the entirety of SEQ ID NO: 35 or across the entirety of one of SEQ ID NO: 1, SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID NOS: 24-30). The percent identity between two sequences can be determined using techniques similar to those described below, with or without allowing gaps. In calculating percent identity, typically exact matches are counted.

The determination of percent identity between two sequences can be accomplished using a mathematical algorithm. A non-limiting example of a mathematical algorithm utilized for the comparison of two sequences is the algorithm of Karlin and Altschul, (1990) Proc. Natl. Acad. Sci. USA 87:2264, modified as in Karlin and Altschul, (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is incorporated into the BLASTN and BLASTX programs of Altschul, et al., (1990) J. Mol. Biol. 215:403. BLAST nucleotide searches can be performed with the BLASTN program, score=100, wordlength=12, to obtain nucleic acid sequences homologous to pesticidal-like nucleic acid molecules. BLAST protein searches can be performed with the BLASTX program, score=50, wordlength=3, to obtain amino acid sequences homologous to pesticidal protein molecules. To obtain gapped alignments for comparison purposes, Gapped BLAST (in BLAST 2.0) can be utilized as described in Altschul, et al., (1997) Nucleic Acids Res. 25:3389. Alternatively, PSI-Blast can be used to perform an iterated search that detects distant relationships between molecules. See, Altschul, et al., (1997) supra. When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the respective programs (e.g., BLASTX and BLASTN) can be used. Alignment may also be performed manually by inspection.

Another non-limiting example of a mathematical algorithm utilized for the comparison of sequences is the ClustalW algorithm (Higgins, et al., (1994) Nucleic Acids Res. 22:4673-4680). ClustalW compares sequences and aligns the entirety of the amino acid or DNA sequence and thus can provide data about the sequence conservation of the entire amino acid sequence. The ClustalW algorithm is used in several commercially available DNA/amino acid analysis software packages, such as the ALIGNX® module of the Vector NTI® Program Suite (Invitrogen Corporation, Carlsbad, Calif.). After alignment of amino acid sequences with ClustalW, the percent amino acid identity can be assessed. A non-limiting example of a software program useful for analysis of ClustalW alignments is GENEDOC™. GENEDOC™ (Karl Nicholas) allows assessment of amino acid (or DNA) similarity and identity between multiple proteins. Another non-limiting example of a mathematical algorithm utilized for the comparison of sequences is the algorithm of Myers and Miller, (1988) CABIOS 4:11-17. Such an algorithm is incorporated into the ALIGN program (version 2.0), which is part of the GCG Wisconsin Genetics Software Package, Version 10 (available from Accelrys, Inc., 9685 Scranton Rd., San Diego, Calif., USA). When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4 can be used.

Another non-limiting example of a mathematical algorithm utilized for the comparison of sequences is the algorithm of Needleman and Wunsch, (1970) J. Mol. Biol. 48(3):443-453, used GAP Version 10 software to determine sequence identity or similarity using the following default parameters: % identity and % similarity for a nucleic acid sequence using GAP Weight of 50 and Length Weight of 3 and the nwsgapdna.cmpii scoring matrix; % identity or % similarity for an amino acid sequence using GAP weight of 8 and length weight of 2, and the BLOSUM62 scoring program. Equivalent programs may also be used. By “equivalent program” is intended any sequence comparison program that, for any two sequences in question, generates an alignment having identical nucleotide residue matches and an identical percent sequence identity when compared to the corresponding alignment generated by GAP Version 10.

The embodiments also encompass nucleic acid molecules encoding variants of PHI-4 polypeptide. “Variants” of the PHI-4 polypeptide encoding nucleic acid sequences include those sequences that encode the PHI-4 polypeptides disclosed herein but that differ conservatively because of the degeneracy of the genetic code as well as those that are sufficiently identical as discussed above. Naturally occurring allelic variants can be identified with the use of well-known molecular biology techniques, such as polymerase chain reaction (PCR) and hybridization techniques as outlined below. Variant nucleic acid sequences also include synthetically derived nucleic acid sequences that have been generated, for example, by using site-directed mutagenesis but which still encode the PHI-4 polypeptides disclosed as discussed below.

The skilled artisan will further appreciate that changes can be introduced by mutation of the nucleic acid sequences thereby leading to changes in the amino acid sequence of the encoded PHI-4 polypeptides, without altering the biological activity of the proteins. Thus, variant nucleic acid molecules can be created by introducing one or more nucleotide substitutions, nucleotide additions and/or nucleotide deletions into the corresponding nucleic acid sequence disclosed herein, such that one or more amino acid substitutions, amino acid additions or amino acid deletions are introduced into the encoded protein. Mutations can be introduced by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Such variant nucleic acid sequences are also encompassed by the present invention.

Alternatively, variant nucleic acid sequences can be made by introducing mutations randomly along all or part of the coding sequence, such as by saturation mutagenesis and the resultant mutants can be screened for ability to confer pesticidal activity to identify mutants that retain activity. Following mutagenesis, the encoded protein can be expressed recombinantly, and the activity of the protein can be determined using standard assay techniques.

The polynucleotides of the disclosure and fragments thereof are optionally used as substrates for a variety of recombination and recursive recombination reactions, in addition to standard cloning methods as set forth in, e.g., Ausubel, Berger and Sambrook, i.e., to produce additional pesticidal polypeptide homologues and fragments thereof with desired properties. A variety of such reactions are known, including those developed by the inventors and their co-workers. Methods for producing a variant of any nucleic acid listed herein comprising recursively recombining such polynucleotide with a second (or more) polynucleotide, thus forming a library of variant polynucleotides are also embodiments of the disclosure, as are the libraries produced, the cells comprising the libraries, and any recombinant polynucleotide produces by such methods. Additionally, such methods optionally comprise selecting a variant polynucleotide from such libraries based on pesticidal activity, as is wherein such recursive recombination is done in vitro or in vivo.

A variety of diversity generating protocols, including nucleic acid recursive recombination protocols are available and fully described in the art. The procedures can be used separately, and/or in combination to produce one or more variants of a nucleic acid or set of nucleic acids, as well as variants of encoded proteins. Individually and collectively, these procedures provide robust, widely applicable ways of generating diversified nucleic acids and sets of nucleic acids (including, e.g., nucleic acid libraries) useful, e.g., for the engineering or rapid evolution of nucleic acids, proteins, pathways, cells and/or organisms with new and/or improved characteristics.

While distinctions and classifications are made in the course of the ensuing discussion for clarity, it will be appreciated that the techniques are often not mutually exclusive. Indeed, the various methods can be used singly or in combination, in parallel or in series, to access diverse sequence variants.

The result of any of the diversity generating procedures described herein can be the generation of one or more nucleic acids, which can be selected or screened for nucleic acids with or which confer desirable properties or that encode proteins with or which confer desirable properties. Following diversification by one or more of the methods herein, or otherwise available to one of skill, any nucleic acids that are produced can be selected for a desired activity or property, e.g. pesticidal activity, or, such activity at a desired pH, etc. This can include identifying any activity that can be detected, for example, in an automated or automatable format, by any of the assays in the art, see, e.g., discussion of screening of insecticidal activity, infra. A variety of related (or even unrelated) properties can be evaluated, in serial or in parallel, at the discretion of the practitioner.

Descriptions of a variety of diversity generating procedures for generating modified nucleic acid sequences, e.g., those coding for polypeptides having pesticidal activity, or fragments thereof, are found in the following publications and the references cited therein: Soong, et al., (2000) Nat Genet 25(4):436-439; Stemmer, et al., (1999) Tumor Targeting 4:1-4; Ness et al. (1999) Nat Biotechnol 17:893-896; Chang et al. (1999) Nat Biotechnol 17:793-797; Minshull and Stemmer, (1999) Curr Opin Chem Biol 3:284-290; Christians, et al., (1999) Nat Biotechnol 17:259-264; Crameri, et al., (1998) Nature 391:288-291; Crameri, et al., (1997) Nat Biotechnol 15:436-438; Zhang, et al., (1997) PNAS USA 94:4504-4509; Patten, et al., (1997) Curr Opin Biotechnol 8:724-733; Crameri, et al, (1996) Nat Med 2:100-103; Crameri, et al., (1996) Nat Biotechnol 14:315-319; Gates, et al., (1996) J Mol Biol 255:373-386; Stemmer, (1996) “Sexual PCR and Assembly PCR” In: The Encyclopedia of Molecular Biology. VCH Publishers, New York. pp. 447-457; Crameri and Stemmer, (1995) Bio Techniques 18:194-195; Stemmer, et al., (1995) Gene, 164:49-53; Stemmer, (1995) Science 270:1510; Stemmer, (1995) Bio/Technology 13:549-553; Stemmer, (1994) Nature 370:389-391 and Stemmer, (1994) PNAS USA 91:10747-10751.

Mutational methods of generating diversity include, for example, site-directed mutagenesis (Ling, et al., (1997) Anal Biochem 254(2):157-178; Dale, et al., (1996) Methods Mol Biol 57:369-374; Smith, (1985) Ann Rev Genet 19:423-462; Botstein and Shortle, (1985) Science 229:1193-1201; Carter, (1986) Biochem J 237:1-7 and Kunkel, (1987) “The efficiency of oligonucleotide directed mutagenesis” in Nucleic Acids & Molecular Biology (Eckstein and Lilley, eds., Springer Verlag, Berlin)); mutagenesis using uracil containing templates (Kunkel, (1985) PNAS USA 82:488-492; Kunkel, et al, (1987) Methods Enzymol 154:367-382 and Bass, et al., (1988) Science 242:240-245); oligonucleotide-directed mutagenesis (Zoller and Smith, (1983) Methods Enzymol 100:468-500; Zoller and Smith, (1987) Methods Enzymol 154:329-350; Zoller and Smith, (1982) Nucleic Acids Res 10:6487-6500), phosphorothioate-modified DNA mutagenesis (Taylor, et al, (1985) Nucl Acids Res 13:8749-8764; Taylor, et al, (1985) Nucl Acids Res 13:8765-8787 (1985); Nakamaye and Eckstein (1986) Nucl Acids Res 14:9679-9698; Sayers, et al., (1988) Nucl Acids Res 16:791-802 and Sayers, et al., (1988) Nucl Acids Res 16: 803-814); mutagenesis using gapped duplex DNA (Kramer, et al., (1984) Nucl Acids Res 12:9441-9456; Kramer and Fritz, (1987) Methods Enzymol 154:350-367; Kramer, et al., (1988) Nucl Acids Res 16:7207 and Fritz, et al., (1988) Nucl Acids Res 16:6987-6999).

Additional suitable methods include point mismatch repair (Kramer, et al., (1984) Cell 38:879-887), mutagenesis using repair-deficient host strains (Carter, et al., (1985) Nucl Acids Res 13:4431-4443 and Carter, (1987) Methods in Enzymol 154:382-403), deletion mutagenesis (Eghtedarzadeh and Henikoff, (1986) Nucl Acids Res 14:5115), restriction-selection and restriction-purification (Wells, et al., (1986) Phil Trans R Soc Lond A 317:415-423), mutagenesis by total gene synthesis (Nambiar, et al., (1984) Science 223:1299-1301; Sakamar and Khorana, (1988) Nucl Acids Res 14:6361-6372; Wells, et al., (1985) Gene 34:315-323 and Grundström, et al., (1985) Nucl Acids Res 13:3305-3316), double-strand break repair (Mandecki, (1986) PNAS USA, 83:7177-7181 and Arnold, (1993) Curr Opin Biotech 4:450-455). Additional details on many of the above methods can be found in Methods Enzymol Volume 154, which also describes useful controls for trouble-shooting problems with various mutagenesis methods.

Additional details regarding various diversity generating methods can be found in the following US Patents, PCT Publications and Applications and EPO Publications: U.S. Pat. Nos. 5,723,323, 5,763,192, 5,814,476, 5,817,483, 5,824,514, 5,976,862, 5,605,793, 5,811,238, 5,830,721, 5,834,252, 5,837,458, WO 1995/22625, WO 1996/33207, WO 1997/20078, WO 1997/35966, WO 1999/41402, WO 1999/41383, WO 1999/41369, WO 1999/41368, EP 752008, EP 0932670, WO 1999/23107, WO 1999/21979, WO 1998/31837, WO 1998/27230, WO 1998/27230, WO 2000/00632, WO 2000/09679, WO 1998/42832, WO 1999/29902, WO 1998/41653, WO 1998/41622, WO 1998/42727, WO 2000/18906, WO 2000/04190, WO 2000/42561, WO 2000/42559, WO 2000/42560, WO 2001/23401, and PCT/US01/06775.

The nucleotide sequences of the embodiments can also be used to isolate corresponding sequences from other organisms, particularly other bacteria. In this manner, methods such as PCR, hybridization and the like can be used to identify such sequences based on their sequence homology to the sequences set forth herein. Sequences that are selected based on their sequence identity to the entire sequences set forth herein or to fragments thereof are encompassed by the embodiments. Such sequences include sequences that are orthologs of the disclosed sequences. The term “orthologs” refers to genes derived from a common ancestral gene and which are found in different species as a result of speciation. Genes found in different species are considered orthologs when their nucleotide sequences and/or their encoded protein sequences share substantial identity as defined elsewhere herein. Functions of orthologs are often highly conserved among species.

In a PCR approach, oligonucleotide primers can be designed for use in PCR reactions to amplify corresponding DNA sequences from cDNA or genomic DNA extracted from any organism of interest. Methods for designing PCR primers and PCR cloning are generally known in the art and are disclosed in Sambrook, et al., (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.), hereinafter “Sambrook”. See also, Innis, et al., eds. (1990) PCR Protocols: A Guide to Methods and Applications (Academic Press, New York); Innis and Gelfand, eds. (1995) PCR Strategies (Academic Press, New York); and Innis and Gelfand, eds. (1999) PCR Methods Manual (Academic Press, New York). Known methods of PCR include, but are not limited to, methods using paired primers, nested primers, single specific primers, degenerate primers, gene-specific primers, vector-specific primers, partially-mismatched primers, and the like.

To identify potential PHI-4 polypeptides from bacterial collections, the bacterial cell lysates can be screened with antibodies generated against a PHI-4 polypeptide using Western blotting and/or ELISA methods. This type of assays can be performed in a high throughput fashion. Positive samples can be further analyzed by various techniques such as antibody based protein purification and identification. Methods of generating antibodies are well known in the art as discussed infra.

Alternatively, mass spectrometry based protein identification method can be used to identify homologs of a PHI-4 polypeptide using protocols in the literatures (Patterson, (1998), 10(22):1-24, Current Protocol in Molecular Biology published by John Wiley & Son Inc). Specifically, LC-MS/MS based protein identification method is used to associate the MS data of given cell lysate or desired molecular weight enriched samples (excised from SDS-PAGE gel of relevant molecular weight bands to a PHI-4 polypeptide) with sequence information of a PHI-4 polypeptide and its homologs. Any match in peptide sequences indicates the potential of having the homologous proteins in the samples. Additional techniques (protein purification and molecular biology) can be used to isolate the protein and identify the sequences of the homologs.

In hybridization methods, all or part of the pesticidal nucleic acid sequence can be used to screen cDNA or genomic libraries. Methods for construction of such cDNA and genomic libraries are generally known in the art and are disclosed in Sambrook and Russell, (2001), supra. The so-called hybridization probes may be genomic DNA fragments, cDNA fragments, RNA fragments or other oligonucleotides, and may be labeled with a detectable group such as 32P, or any other detectable marker, such as other radioisotopes, a fluorescent compound, an enzyme, or an enzyme co-factor. Probes for hybridization can be made by labeling synthetic oligonucleotides based on the known PHI-4 polypeptide-encoding nucleic acid sequence disclosed herein. Degenerate primers designed on the basis of conserved nucleotides or amino acid residues in the nucleic acid sequence or encoded amino acid sequence can additionally be used. The probe typically comprises a region of nucleic acid sequence that hybridizes under stringent conditions to at least about 12, at least about 25, at least about 50, 75, 100, 125, 150, 175 or 200 consecutive nucleotides of nucleic acid sequence encoding a PHI-4 polypeptide of the disclosure or a fragment or variant thereof. Methods for the preparation of probes for hybridization are generally known in the art and are disclosed in Sambrook and Russell, (2001), supra, herein incorporated by reference.

For example, an entire nucleic acid sequence, encoding a PHI-4 polypeptide, disclosed herein, or one or more portions thereof, may be used as a probe capable of specifically hybridizing to corresponding nucleic acid sequences encoding PHI-4 polypeptide-like sequences and messenger RNAs. To achieve specific hybridization under a variety of conditions, such probes include sequences that are unique and are preferably at least about 10 nucleotides in length, or at least about 20 nucleotides in length. Such probes may be used to amplify corresponding pesticidal sequences from a chosen organism by PCR. This technique may be used to isolate additional coding sequences from a desired organism or as a diagnostic assay to determine the presence of coding sequences in an organism. Hybridization techniques include hybridization screening of plated DNA libraries (either plaques or colonies; see, for example, Sambrook, et al., (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.).

Hybridization of such sequences may be carried out under stringent conditions. By “stringent conditions” or “stringent hybridization conditions” is intended conditions under which a probe will hybridize to its target sequence to a detectably greater degree than to other sequences (e.g., at least 2-fold over background). Stringent conditions are sequence-dependent and will be different in different circumstances. By controlling the stringency of the hybridization and/or washing conditions, target sequences that are 100% complementary to the probe can be identified (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected (heterologous probing). Generally, a probe is less than about 1000 nucleotides in length, preferably less than 500 nucleotides in length.

Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., 10 to 50 nucleotides) and at least about 60° C. for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCl, 1% SDS (sodium dodecyl sulphate) at 37° C., and a wash in 1× to 2×SSC (20×SSC=3.0 M NaCl/0.3 M trisodium citrate) at 50 to 55° C. Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1.0 M NaCl, 1% SDS at 37° C., and a wash in 0.5× to 1×SSC at 55 to 60° C. Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37° C., and a wash in 0.1×SSC at 60 to 65° C. Optionally, wash buffers may comprise about 0.1% to about 1% SDS. Duration of hybridization is generally less than about 24 hours, usually about 4 to about 12 hours.

Specificity is typically the function of post-hybridization washes, the critical factors being the ionic strength and temperature of the final wash solution. For DNA-DNA hybrids, the Tm can be approximated from the equation of Meinkoth and Wahl, (1984) Anal. Biochem. 138:267-284: Tm=81.5° C.+16.6 (log M)+0.41 (% GC)−0.61 (% form)−500/L; where M is the molarity of monovalent cations, % GC is the percentage of guanosine and cytosine nucleotides in the DNA, % form is the percentage of formamide in the hybridization solution, and L is the length of the hybrid in base pairs. The Tm is the temperature (under defined ionic strength and pH) at which 50% of a complementary target sequence hybridizes to a perfectly matched probe. Tm is reduced by about 1° C. for each 1% of mismatching; thus, Tm, hybridization, and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with 90% identity are sought, the Tm can be decreased 10° C. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence and its complement at a defined ionic strength and pH. However, severely stringent conditions can utilize a hybridization and/or wash at 1, 2, 3 or 4° C. lower than the thermal melting point (Tm); moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9 or 10° C. lower than the thermal melting point (Tm); low stringency conditions can utilize a hybridization and/or wash at 11, 12, 13, 14, 15, or 20° C. lower than the thermal melting point (Tm). Using the equation, hybridization and wash compositions, and desired Tm, those of ordinary skill will understand that variations in the stringency of hybridization and/or wash solutions are inherently described. If the desired degree of mismatching results in a Tm of less than 45° C. (aqueous solution) or 32° C. (formamide solution), it is preferred to increase the SSC concentration so that a higher temperature can be used. An extensive guide to the hybridization of nucleic acids is found in Tijssen, (1993) Laboratory Techniques in Biochemistry and Molecular Biology—Hybridization with Nucleic Acid Probes, Part I, Chapter 2 (Elsevier, N.Y.); and Ausubel, et al., eds. (1995) Current Protocols in Molecular Biology, Chapter 2 (Greene Publishing and Wiley-Interscience, New York). See, Sambrook, et al., (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.).

Proteins and Variants and Fragments Thereof

PHI-4 polypeptides are encompassed by the disclosure. By “PHI-4 polypeptide” or “PHI-4 protein” as used herein interchangeably is intended a polypeptide that has increased insecticidal activity against one or more insect pests of the Lepidoptera and/or Coleoptera orders compared to the protein of SEQ ID NO: 35, and is sufficiently identical to the protein of SEQ ID NO: 35. A variety of PHI-4 polypeptides are contemplated.

The Western Corn Rootworm active protein AXMI-205 (SEQ ID NO: 35) encoded by the polynucleotide of SEQ ID NO: 34 was identified from the Chromobacterium Strain ATX 2024 (US20110023184). Synthetic genes encoding AXMI-205; AXMI-205 variants having a truncation of the last 10 and 20 amino acids from the C-terminus (SEQ ID NO: 36 and SEQ ID NO: 37); and alanine scanning at every other residue from residue 307-536 of AXMI-205 (SEQ ID NO: 35), with 5307A, D315A, V317A, S349A, G351A, K353A, V355A, D395A, G399A, W407A, G419A, P435A, S443A, K465A, V467A, F483A, P487A, S495A, D497A, E499A, K509A, and 1513A identified as having WCRW activity; are disclosed in US20110023184. AXMI-205 variants evo 24 (E499A); evo25 (V467A—SEQ ID NO: 41); evo30 (V467L—SEQ ID NO: 42); PMlibl PoolIG2_p2al 1 (S468L—SEQ ID NO: 45); PMlibl PoolIG2_plcl (V467T—SEQ ID NO: 46), PMlibl PoolIG2_pla4 (R464N—SEQ ID NO: 47), evo34 (E86T—SEQ ID NO: 43) and evo35 (Q517R—SEQ ID NO: 44) having increased insecticidal activity are disclosed in WO2013/016617.

As used herein, the terms “protein,” “peptide molecule” or “polypeptide” includes any molecule that comprises five or more amino acids. It is well known in the art that protein, peptide or polypeptide molecules may undergo modification, including post-translational modifications, such as, but not limited to, disulfide bond formation, glycosylation, phosphorylation or oligomerization. Thus, as used herein, the terms “protein,” “peptide molecule” or “polypeptide” includes any protein that is modified by any biological or non-biological process. The terms “amino acid” and “amino acids” refer to all naturally occurring L-amino acids.

A “recombinant protein” is used to refer to a protein that is no longer in its natural environment, for example in vitro or in a recombinant bacterial or plant host cell. A PHI-4 polypeptide that is substantially free of cellular material includes preparations of protein having less than about 30%, 20%, 10% or 5% (by dry weight) of non-pesticidal protein (also referred to herein as a “contaminating protein”).

By “improved activity” or “increased activity” is intended an increase of at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 110%, at least about 120%, at least about 130%, at least about 140%, at least about 150%, at least about 160%, at least about 170%, at least about 180%, at least about 190%, at least about 200%, at least about 210% at least about 220%, at least about 230%, at least about 240%, at least about 250%, at least about 260%, at least about 270%, at least about 280%, at least about 290%, at least about 300%, at least about 310%, at least about 320%, at least about 330%, at least about 340%, at least about 350%, at least about 360%, at least about 370%, at least about 380%, at least about 390%, at least about 400%, at least about 410%, at least about 420%, at least about 430%, at least about 440%, at least about 450%, at least about 460%, at least about 470%, at least about 480%, at least about 490%, at least about 500%, at least about 510%, at least about 520%, at least about 530%, at least about 540%, at least about 550%, at least about 560%, at least about 570%, at least about 580%, at least about 590%, at least about 600%, at least about 650%, at least about 700%, at least about 750%, at least about 800%, at least about 850%, at least about 900%, at least about 950%, at least about 1000% or higher, or at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, or at least about 1.5-fold, at least about 1.6-fold, at least about 1.7-fold, at least about 1.8-fold, at least about 1.9-fold, at least about 2-fold, at least about 2.1-fold, at least about 2.2-fold, at least about 2.3-fold, at least about 2.4-fold, at least about 2.5-fold, at least about 2.6-fold, at least about 2.7-fold, at least about 2.8-fold, at least about 2.9-fold, at least about 3-fold, at least about 3.1-fold, at least about 3.2-fold, at least about 3.3-fold, at least about 3.4-fold, at least about 3.5-fold, at least about 3.6-fold, at least about 3.7-fold, at least about 3.8-fold, at least about 3.9-fold, at least about 4-fold, at least about 4.1-fold, at least about 4.2-fold, at least about 4.3-fold, at least about 4.4-fold, at least about 4.5-fold, at least about 4.6-fold, at least about 4.7-fold, at least about 4.8-fold, at least about 4.9-fold, at least about 5-fold, at least about 5.1-fold, at least about 5.2-fold, at least about 5.3-fold, at least about 5.4-fold, at least about 5.5-fold, at least about 5.6-fold, at least about 5.7-fold, at least about 5.8-fold, at least about 5.9-fold, at least about 6-fold, at least about 6.1-fold, at least about 6.2-fold, at least about 6.3-fold, at least about 6.4-fold, at least about 6.5-fold, at least about 6.6-fold, at least about 6.7-fold, at least about 6.8-fold, at least about 6.9-fold, at least about 7-fold, at least about 7.1-fold, at least about 7.2-fold, at least about 7.3-fold, at least about 7.4-fold, at least about 7.5-fold, at least about 7.6-fold, at least about 7.7-fold, at least about 7.8-fold, at least about 7.9-fold, at least about 8-fold, at least about 8.1-fold, at least about 8.2-fold, at least about 8.3-fold, at least about 8.4-fold, at least about 8.5-fold, at least about 8.6-fold, at least about 8.7-fold, at least about 8.8-fold, at least about 8.9-fold, at least about 9-fold, at least about 9.1-fold, at least about 9.2-fold, at least about 9.3-fold, at least about 9.4-fold, at least about 9.5-fold, at least about 9.6-fold, at least about 9.7-fold, at least about 9.8-fold, at least about 9.9-fold, at least about 10-fold, or higher increase in the pesticidal activity of the variant protein relative to the pesticidal activity of AXMI-205 (SEQ ID NO: 35).

In some embodiments, the improvement consists of a decrease in the EC50 of at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 110%, at least about 120%, at least about 130%, at least about 140%, at least about 150%, at least about 160%, at least about 170%, at least about 180%, at least about 190%, at least about 200%, at least about 210% at least about 220%, at least about 230%, at least about 240%, at least about 250%, at least about 260%, at least about 270%, at least about 280%, at least about 290%, at least about 300%, at least about 310%, at least about 320%, at least about 330%, at least about 340%, at least about 350%, at least about 360%, at least about 370%, at least about 380%, at least about 390%, at least about 400%, at least about 410%, at least about 420%, at least about 430%, at least about 440%, at least about 450%, at least about 460%, at least about 470%, at least about 480%, at least about 490%, at least about 500%, at least about 510%, at least about 520%, at least about 530%, at least about 540%, at least about 550%, at least about 560%, at least about 570%, at least about 580%, at least about 590%, at least about 600%, at least about 650%, at least about 700%, at least about 750%, at least about 800%, at least about 850%, at least about 900%, at least about 950%, at least about 1000% or higher, or at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, or at least about 1.5-fold, at least about 1.6-fold, at least about 1.7-fold, at least about 1.8-fold, at least about 1.9-fold, at least about 2-fold, at least about 2.1-fold, at least about 2.2-fold, at least about 2.3-fold, at least about 2.4-fold, at least about 2.5-fold, at least about 2.6-fold, at least about 2.7-fold, at least about 2.8-fold, at least about 2.9-fold, at least about 3-fold, at least about 3.1-fold, at least about 3.2-fold, at least about 3.3-fold, at least about 3.4-fold, at least about 3.5-fold, at least about 3.6-fold, at least about 3.7-fold, at least about 3.8-fold, at least about 3.9-fold, at least about 4-fold, at least about 4.1-fold, at least about 4.2-fold, at least about 4.3-fold, at least about 4.4-fold, at least about 4.5-fold, at least about 4.6-fold, at least about 4.7-fold, at least about 4.8-fold, at least about 4.9-fold, at least about 5-fold, at least about 5.1-fold, at least about 5.2-fold, at least about 5.3-fold, at least about 5.4-fold, at least about 5.5-fold, at least about 5.6-fold, at least about 5.7-fold, at least about 5.8-fold, at least about 5.9-fold, at least about 6-fold, at least about 6.1-fold, at least about 6.2-fold, at least about 6.3-fold, at least about 6.4-fold, at least about 6.5-fold, at least about 6.6-fold, at least about 6.7-fold, at least about 6.8-fold, at least about 6.9-fold, at least about 7-fold, at least about 7.1-fold, at least about 7.2-fold, at least about 7.3-fold, at least about 7.4-fold, at least about 7.5-fold, at least about 7.6-fold, at least about 7.7-fold, at least about 7.8-fold, at least about 7.9-fold, at least about 8-fold, at least about 8.1-fold, at least about 8.2-fold, at least about 8.3-fold, at least about 8.4-fold, at least about 8.5-fold, at least about 8.6-fold, at least about 8.7-fold, at least about 8.8-fold, at least about 8.9-fold, at least about 9-fold, at least about 9.1-fold, at least about 9.2-fold, at least about 9.3-fold, at least about 9.4-fold, at least about 9.5-fold, at least about 9.6-fold, at least about 9.7-fold, at least about 9.8-fold, at least about 9.9-fold, at least about 10-fold, or greater reduction in the EC50 of the PHI-4 polypeptide relative to the pesticidal activity of AXMI-205 (SEQ ID NO: 35).

In some embodiments the EC50 of the PHI-4 polypeptide is <100 ppm, <90 ppm, <80 ppm, <70 ppm, <60 ppm, <50 ppm, <45 ppm, <40 ppm, <35 ppm, <30 ppm, <25 ppm, <20 ppm, <19 ppm, <18 ppm, <17 ppm, <16 ppm, <15 ppm, <14 ppm, <13 ppm, <12 ppm, <11 ppm, <10 ppm, <9 ppm, <8 ppm, <7 ppm, <6 ppm, <5 ppm, <4 ppm, <3 ppm, <2 ppm, <1 ppm, <0.9 ppm, <0.8 ppm, <0.7 ppm, <0.6 ppm, <0.5 ppm, <0.4 ppm, <0.3 ppm, <0.2 ppm, <0.1 ppm,

In some embodiments, the improvement consists of an increase in the Mean FAE Index of at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 110%, at least about 120%, at least about 130%, at least about 140%, at least about 150%, at least about 160%, at least about 170%, at least about 180%, at least about 190%, at least about 200%, at least about 210% at least about 220%, at least about 230%, at least about 240%, at least about 250%, at least about 260%, at least about 270%, at least about 280%, at least about 290%, at least about 300%, at least about 310%, at least about 320%, at least about 330%, at least about 340%, at least about 350%, at least about 360%, at least about 370%, at least about 380%, at least about 390%, at least about 400%, at least about 410%, at least about 420%, at least about 430%, at least about 440%, at least about 450%, at least about 460%, at least about 470%, at least about 480%, at least about 490%, at least about 500%, at least about 510%, at least about 520%, at least about 530%, at least about 540%, at least about 550%, at least about 560%, at least about 570%, at least about 580%, at least about 590%, at least about 600%, at least about 650%, at least about 700%, at least about 750%, at least about 800%, at least about 850%, at least about 900%, at least about 950%, at least about 1000% or higher, or at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, or at least about 1.5-fold, at least about 1.6-fold, at least about 1.7-fold, at least about 1.8-fold, at least about 1.9-fold, at least about 2-fold, at least about 2.1-fold, at least about 2.2-fold, at least about 2.3-fold, at least about 2.4-fold, at least about 2.5-fold, at least about 2.6-fold, at least about 2.7-fold, at least about 2.8-fold, at least about 2.9-fold, at least about 3-fold, at least about 3.1-fold, at least about 3.2-fold, at least about 3.3-fold, at least about 3.4-fold, at least about 3.5-fold, at least about 3.6-fold, at least about 3.7-fold, at least about 3.8-fold, at least about 3.9-fold, at least about 4-fold, at least about 4.1-fold, at least about 4.2-fold, at least about 4.3-fold, at least about 4.4-fold, at least about 4.5-fold, at least about 4.6-fold, at least about 4.7-fold, at least about 4.8-fold, at least about 4.9-fold, at least about 5-fold, at least about 5.1-fold, at least about 5.2-fold, at least about 5.3-fold, at least about 5.4-fold, at least about 5.5-fold, at least about 5.6-fold, at least about 5.7-fold, at least about 5.8-fold, at least about 5.9-fold, at least about 6-fold, at least about 6.1-fold, at least about 6.2-fold, at least about 6.3-fold, at least about 6.4-fold, at least about 6.5-fold, at least about 6.6-fold, at least about 6.7-fold, at least about 6.8-fold, at least about 6.9-fold, at least about 7-fold, at least about 7.1-fold, at least about 7.2-fold, at least about 7.3-fold, at least about 7.4-fold, at least about 7.5-fold, at least about 7.6-fold, at least about 7.7-fold, at least about 7.8-fold, at least about 7.9-fold, at least about 8-fold, at least about 8.1-fold, at least about 8.2-fold, at least about 8.3-fold, at least about 8.4-fold, at least about 8.5-fold, at least about 8.6-fold, at least about 8.7-fold, at least about 8.8-fold, at least about 8.9-fold, at least about 9-fold, at least about 9.1-fold, at least about 9.2-fold, at least about 9.3-fold, at least about 9.4-fold, at least about 9.5-fold, at least about 9.6-fold, at least about 9.7-fold, at least about 9.8-fold, at least about 9.9-fold, at least about 10-fold, or higher increase in the Mean FAE Index of the PHI-4 polypeptide relative to the pesticidal activity of AXMI-205 (SEQ ID NO: 35).

In some embodiments, the improvement consists of an increase in the Mean Deviation Score of at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 110%, at least about 120%, at least about 130%, at least about 140%, at least about 150%, at least about 160%, at least about 170%, at least about 180%, at least about 190%, at least about 200%, at least about 210% at least about 220%, at least about 230%, at least about 240%, at least about 250%, at least about 260%, at least about 270%, at least about 280%, at least about 290%, at least about 300%, at least about 310%, at least about 320%, at least about 330%, at least about 340%, at least about 350%, at least about 360%, at least about 370%, at least about 380%, at least about 390%, at least about 400%, at least about 410%, at least about 420%, at least about 430%, at least about 440%, at least about 450%, at least about 460%, at least about 470%, at least about 480%, at least about 490%, at least about 500%, at least about 510%, at least about 520%, at least about 530%, at least about 540%, at least about 550%, at least about 560%, at least about 570%, at least about 580%, at least about 590%, at least about 600%, at least about 650%, at least about 700%, at least about 750%, at least about 800%, at least about 850%, at least about 900%, at least about 950%, at least about 1000% or higher, or at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, or at least about 1.5-fold, at least about 1.6-fold, at least about 1.7-fold, at least about 1.8-fold, at least about 1.9-fold, at least about 2-fold, at least about 2.1-fold, at least about 2.2-fold, at least about 2.3-fold, at least about 2.4-fold, at least about 2.5-fold, at least about 2.6-fold, at least about 2.7-fold, at least about 2.8-fold, at least about 2.9-fold, at least about 3-fold, at least about 3.1-fold, at least about 3.2-fold, at least about 3.3-fold, at least about 3.4-fold, at least about 3.5-fold, at least about 3.6-fold, at least about 3.7-fold, at least about 3.8-fold, at least about 3.9-fold, at least about 4-fold, at least about 4.1-fold, at least about 4.2-fold, at least about 4.3-fold, at least about 4.4-fold, at least about 4.5-fold, at least about 4.6-fold, at least about 4.7-fold, at least about 4.8-fold, at least about 4.9-fold, at least about 5-fold, at least about 5.1-fold, at least about 5.2-fold, at least about 5.3-fold, at least about 5.4-fold, at least about 5.5-fold, at least about 5.6-fold, at least about 5.7-fold, at least about 5.8-fold, at least about 5.9-fold, at least about 6-fold, at least about 6.1-fold, at least about 6.2-fold, at least about 6.3-fold, at least about 6.4-fold, at least about 6.5-fold, at least about 6.6-fold, at least about 6.7-fold, at least about 6.8-fold, at least about 6.9-fold, at least about 7-fold, at least about 7.1-fold, at least about 7.2-fold, at least about 7.3-fold, at least about 7.4-fold, at least about 7.5-fold, at least about 7.6-fold, at least about 7.7-fold, at least about 7.8-fold, at least about 7.9-fold, at least about 8-fold, at least about 8.1-fold, at least about 8.2-fold, at least about 8.3-fold, at least about 8.4-fold, at least about 8.5-fold, at least about 8.6-fold, at least about 8.7-fold, at least about 8.8-fold, at least about 8.9-fold, at least about 9-fold, at least about 9.1-fold, at least about 9.2-fold, at least about 9.3-fold, at least about 9.4-fold, at least about 9.5-fold, at least about 9.6-fold, at least about 9.7-fold, at least about 9.8-fold, at least about 9.9-fold, at least about 10-fold, or higher increase in the Mean Deviation Score of the PHI-4 polypeptide relative to the pesticidal activity of AXMI-205 (SEQ ID NO: 35).

In some embodiments the improved activity of the PHI-4 polypeptide is relative to the pesticidal activity of AXMI-205(evo25) (SEQ ID NO: 41), AXMI-205(evo30) (SEQ ID NO: 42), Axmi205 PMlibl PoolIG2_p2al 1 (mutation S468L; SEQ ID NO: 45), Axmi205 PMlibl PoolIG2_plcl (mutation V467T; SEQ ID NO: 46), Axmi205 PMlibl PoolIG2_pla4 (mutation R464N; SEQ ID NO: 47), AXMI-205(evo34) (SEQ ID NO: 43) or AXMI-205(evo35) (SEQ ID NO: 44).

“Mean FAE Index” (MFI) refers to the mean of multiple FAEGN an arithmetic mean of FAEGN. As used herein, the “Mean Deviation Score” refers to the arithmetic mean of multiple Deviation Scores.

In some embodiments the PHI-4 polypeptides have increased insecticidal activity against one or more insect pests of the order Coleoptera.

In some embodiments the PHI-4 polypeptides have increased insecticidal activity against Diabrotica virgifera larvae—Western Corn Root Worm (WCRW).

“Fragments” or “biologically active portions” include polypeptide fragments comprising amino acid sequences sufficiently identical to a PHI-4 polypeptide and that exhibit insecticidal activity. “Fragments” or “biologically active portions” include polypeptide fragments comprising amino acid sequences sufficiently identical to the amino acid sequence set forth in SEQ ID NO: 35 and SEQ ID NOs: 51-819 and that exhibit insecticidal activity. A biologically active portion of a PHI-4 polypeptide can be a polypeptide that is, for example, 10, 25, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534 or 535 amino acids in length. Such biologically active portions can be prepared by recombinant techniques and evaluated for insecticidal activity. As used here, a fragment comprises at least 8 contiguous amino acids of a PHI-4 polypeptide. In some embodiments a fragment comprises at least 8 contiguous amino acids of SEQ ID NO: 2 or SEQ ID NOs: 51-819. In some embodiments a fragment comprises at least 8 contiguous amino acids of SEQ ID NO: 2 or SEQ ID NOs: 51-819. The embodiments encompass other fragments, however, such as any fragment in the protein greater than about 10, 20, 30, 50, 100, 150, 200, 250 or more amino acids.

In some embodiments, the fragment is an N-terminal and/or a C-terminal truncation of at least about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25 or more amino acids relative to SEQ ID NO: 2, SEQ ID NO: 35 or SEQ ID NOs: 51-819 or variants thereof e.g., by proteolysis, by insertion of a start codon, by deletion of the codons encoding the deleted amino acids and concomitant insertion of a start codon and/or insertion of a stop codon. In some embodiments, the fragments encompassed herein result from the removal of the C-terminal 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, or 29 amino acids relative to SEQ ID NO: 35, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819, and variants thereof by proteolysis or by insertion of a start codon, by deletion of the codons encoding the deleted amino acids and concomitant insertion of a start codon. In particular embodiments the proteolytic cleavage site is between Lys at 520 and Ser at 521 Ser or Lys at 313 and Val at 314 of SEQ ID NO: 35 or variants thereof. It is well known in the art that polynucleotide encoding the truncated PHI-4 polypeptide can be engineered to add a start codon at the N-terminus such as ATG encoding methionine or methionine followed by an alanine. It is also well known in the art that depending on what host the PHI-4 polypeptide is expressed in the methionine may be partially of completed processed off.

In some embodiments fragments, biologically active portions, of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819 as well as amino acid substitutions, amino acid deletions and/or insertions thereof are also provided, and may be used to practice the methods of the disclosure.

In some embodiments PHI-4 polypeptides are provided having one or more amino acid substitution compared to AXMI-205 (SEQ ID NO: 35). In some embodiments PHI-4 polypeptides are provided having amino acid substitutions at solvent exposed surface residues to modify the protein characteristics of AXMI-205 (SEQ ID NO: 35), including but not limited to the ionic polarity of the protein surface. In some embodiments PHI-4 polypeptides are provided having amino acid substitutions at hydrophilic residues such as Asp, Glu, Lys, Arg, His, Ser, Thr, Tyr, Trp, Asn, Gln, and Cys. In some embodiments the PHI-4 polypeptides are provided having amino acid substitutions changing a Lysine or Arginine to a Glutamine, Glutamic Acid, Asparagine or Glutamic Acid; changing a Glutamic Acid or Aspartic Acid to a Lysine, Asparagine or Glutamine; and changing a Glutamine to a Asparagine or Lysine.

In some embodiments PHI-4 polypeptides are provided having amino acid substitutions at residues in a membrane insertion loop. In some embodiments PHI-4 polypeptides are provided having amino acid substitutions in a membrane insertion loop between about amino acid at position 92 (Val) and 101 (Ala) and/or at position 211 (Gly) and 220 (Glu) relative to SEQ ID NO: 35.

In some embodiments PHI-4 polypeptides are provided having amino acid substitutions at residues and receptor binding loops. In some embodiments PHI-4 polypeptides are provided having amino acid substitutions at residues and receptor binding loops between about amino acid 332 (Asp) and 340 (Asp), 395 (Asp) and 403 (Asp), 458 (Asp) and 466 (Asp) relative to SEQ ID NO: 35.

In some embodiments PHI-4 polypeptides are provided having amino acid substitutions at residues in a protease sensitive region. In some embodiments PHI-4 polypeptides are provided having amino acid substitutions at residues in a protease sensitive region from about amino acid residues between 305 (Lys) and 316 (Lys) and 500 (Arg) and 535 (Lys) relative to SEQ ID NO: 35.

By variants is intended proteins or polypeptides having an amino acid sequence that is at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to the parental amino acid sequence. In some embodiments a PHI-4 polypeptide has at least about 60%, 65%, about 70%, 75%, at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater identity across the entire length of the amino acid sequence of SEQ ID NO: 2 or SEQ ID NOs: 51-819. In some embodiments a PHI-4 polypeptide has at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater identity across the entire length of the amino acid sequence of SEQ ID NO: 2 or SEQ ID NOs: 51-819.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence having at least 80% identity, to the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819, wherein the polypeptide has insecticidal activity.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence having at least 90% identity to the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819, wherein the polypeptide has insecticidal activity.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence having at least 95% identity to the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819 wherein the polypeptide has insecticidal activity.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence having at least 97% identity to the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819, wherein the polypeptide has insecticidal activity.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence of SEQ ID NO: 4, wherein Xaa at position 9 is Gln, Lys or Glu; Xaa at position 14 is Pro or Ala; Xaa at position 16 is Val or Asp; Xaa at position 19 is Met or Leu; Xaa at position 22 is Gly or Ser; Xaa at position 24 is Asp, Asn or Gln; Xaa at position 36 is Leu or Met; Xaa at position 42 is Asp, Asn or Gln; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu, Asp, Asn or Gly; Xaa at position 50 is Ile or Val; Xaa at position 51 is Glu or Gln; Xaa at position 55 is Arg or Lys; Xaa at position 56 is Ser or Thr; Xaa at position 57 is Tyr or Phe; Xaa at position 58 is Thr or Ser; Xaa at position 61 is Arg, Lys or Glu; Xaa at position 73 is Phe or Tyr; Xaa at position 74 is Lys, Glu, Gly, Arg, Met, Leu, His or Asp; Xaa at position 76 is Asp or Gln; Xaa at position 79 is Lys or Glu; Xaa at position 80 is Glu or Ser; Xaa at position 82 is Glu, Ile, Leu, Tyr or Gln; Xaa at position 83 is Glu or Gln; Xaa at position 84 is Tyr or Phe; Xaa at position 86 is Glu or Gln; Xaa at position 87 is Lys or Gln; Xaa at position 88 is Met, Ile or Leu; Xaa at position 90 is Gln or Glu; Xaa at position 94 is Val or Ile; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln, Gly or Ser; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile, Met, Phe, Cys, Val or Asn; Xaa at position 103 is Ala or Gly; Xaa at position 105 is Leu or Ile; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp, Asn, Glu, Cys, Ala or Arg; Xaa at position 112 is Thr or Ser; Xaa at position 113 is Asp, Glu or Met; Xaa at position 117 is Thr or Ser; Xaa at position 121 is Tyr or Phe; Xaa at position 127 is Ala or Thr; Xaa at position 142 is Arg or Glu; Xaa at position 146 is Arg or Gln; Xaa at position 147 is Arg, Glu or Gln; Xaa at position 148 is Asp, Phe, Pro, Val, Glu, His, Trp, Ala, Arg, Leu, Ser, Gln or Gly; Xaa at position 149 is Phe or Val; Xaa at position 150 is Arg, Gln or Glu; Xaa at position 151 is Asp, Ser, Ala, Asn, Trp, Val, Gln, Cys, Met, Leu, Arg or Glu; Xaa at position 153 is Leu or Ile; Xaa at position 154 is Asn or Asp; Xaa at position 155 is Asn or Lys; Xaa at position 159 is Pro or Asp; Xaa at position 162 is Glu, Asp or Gln; Xaa at position 165 is Lys, Glu, Gln, Pro, Thr, Ala, Leu, Gly, Asp, Val, His, Ile, Met, Trp, Phe, Tyr or Arg; Xaa at position 166 is Arg or Gln; Xaa at position 167 is Tyr, Trp or Cys; Xaa at position 170 is Tyr or His; Xaa at position 171 is Tyr or Phe; Xaa at position 172 is Ile, Leu or Val; Xaa at position 173 is Ser or Ala; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 183 is Tyr or Val; Xaa at position 184 is Ser or Thr; Xaa at position 185 is Ala or Ser; Xaa at position 189 is Thr, Lys or Ile; Xaa at position 191 is Lys or Gln; Xaa at position 193 is Asp or Asn; Xaa at position 196 is Gln, Lys, Asn, Asp, Glu, Ala, Ile or Arg; Xaa at position 202 is Ala or Val; Xaa at position 203 is Glu, Thr or His; Xaa at position 204 is Met or Ala; Xaa at position 206 is Tyr or Phe; Xaa at position 207 is Lys or Gln; Xaa at position 209 is Leu or Pro; Xaa at position 210 is Val or Ile; Xaa at position 214 is Lys, Ser or Gln; Xaa at position 216 is Glu, Gln, Phe, Val, Tyr or Arg; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser, Gln, Arg, Trp, Met, Ala, Phe, Ile, Leu, Cys or Asn; Xaa at position 229 is Arg or Glu; Xaa at position 230 is Ser or Glu; Xaa at position 231 is Asn or Ser; Xaa at position 236 is Leu or Pro; Xaa at position 245 is Met or Leu; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln, Lys or Glu; Xaa at position 257 is Gln, Ile, Glu, Cys, Ser, His, Trp or Met; Xaa at position 261 is Gln, Glu or Lys; Xaa at position 264 is Glu or Gln; Xaa at position 268 is Asp or Asn; Xaa at position 276 is Ser or Ala; Xaa at position 278 is Glu, Asn or Gln; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 282 is Pro or Gly; Xaa at position 284 is Trp or Arg; Xaa at position 287 is Ala or Cys; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr, Asp, Phe, Ser, Met, Arg, Trp, Ile, His, Asn, Cys, Gly or Ala; Xaa at position 291 is Glu or Gln; Xaa at position 292 is Arg or Gln; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 294 is Val or Ala; Xaa at position 296 is Leu or Ile; Xaa at position 297 is Glu or Gln; Xaa at position 298 is Asp or Gln; Xaa at position 300 is Phe or Tyr; Xaa at position 302 is Glu or Gln; Xaa at position 303 is Phe or Tyr; Xaa at position 305 is Lys or Gln; Xaa at position 306 is Gln or Lys; Xaa at position 309 is Gln, Lys or Glu; Xaa at position 313 is Lys, Gln or Arg; Xaa at position 316 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 331 is Glu, Asn or Gln; Xaa at position 333 is Ser, Arg, Gly, Lys, Val, Asn, Ala, His, Gln, Thr, Asp, Ile, Leu, Cys or Glu; Xaa at position 334 is Gly, Arg, Lys, Ile or Trp; Xaa at position 335 is Ser or Ala; Xaa at position 336 is Gly or Ala; Xaa at position 337 is Ala, Val or Gly; Xaa at position 338 is Ser, His, Val, Lys, Ala, Gly, Thr, Ile, Glu, Met, Arg, Pro, Asp, Asn or Leu; Xaa at position 339 is Glu, Asn, Gln, Ile, Pro, Met, Ser, Ala, Cys, Phe, Val, Leu, Asp, Trp, His or Arg; Xaa at position 341 is Leu or Val; Xaa at position 342 is Ala, Ser or Val; Xaa at position 343 is Val or Ile; Xaa at position 344 is Phe or Trp; Xaa at position 345 is Asn or His; Xaa at position 346 is Pro or Ala; Xaa at position 350 is Asn or Ser; Xaa at position 351 is Gly or Val; Xaa at position 354 is Met or Leu; Xaa at position 355 is Val, Ile or Leu; Xaa at position 359 is Gly or Ala; Xaa at position 362 is Asn or Ser; Xaa at position 364 is Ala or Ser; Xaa at position 371 is Ala, Gly or Thr; Xaa at position 374 is Phe or Ile; Xaa at position 375 is Lys or Arg; Xaa at position 380 is Leu or Gly; Xaa at position 382 is Val, Asp or Leu; Xaa at position 383 is Leu, Ile or Val; Xaa at position 384 is Lys, Ala or Gly; Xaa at position 385 is Ala or Gly; Xaa at position 389 is Trp or Tyr; Xaa at position 391 is Arg, Leu, Glu, Gln or Asp; Xaa at position 395 is Asp or Cys; Xaa at position 396 is Ala, Leu, Lys, Asn, Gly, Ile, Met, Arg, Tyr, Gln or His; Xaa at position 397 is Gly, Arg or Ala; Xaa at position 398 is Ser, Gln or Cys; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val, Arg, Ile, Asn, Phe, Thr, Ala, Asp, Met, Gln or Glu; Xaa at position 402 is Lys, Phe, His, Arg, Trp, Gly, Asn, Leu, Tyr, Thr, Val, Met, Pro or Ala; Xaa at position 403 is Asp, Tyr, Trp, Phe or Glu; Xaa at position 405 is Ala or Ser; Xaa at position 409 is Ala or Pro; Xaa at position 410 is Ile or Val; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg, Glu or Gln; Xaa at position 417 is Ala, Ser or Cys; Xaa at position 418 is Leu or Met; Xaa at position 422 is Met or Val; Xaa at position 426 is Thr or Ser; Xaa at position 436 is Asp or Lys; Xaa at position 437 is Tyr or Val; Xaa at position 438 is Val or Arg; Xaa at position 440 is Val or Leu; Xaa at position 442 is Gln, Lys or Glu; Xaa at position 445 is Cys, Leu or Thr; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu, Ile, Gly, Pro, Leu, Phe, Trp or Thr; Xaa at position 448 is Val or Ala; Xaa at position 449 is Gln or Glu; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 453 is Asn or Asp; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp, Gln, Gly, His, Asp, Leu, Thr, Pro or Asn; Xaa at position 455 is Val or Ile; Xaa at position 457 is Trp or Asn; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile, Thr, Ser, His, Cys, Tyr, Pro, Asn, Ala, Arg or Glu; Xaa at position 460 is Gly or Ala; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala, Ser or Gly; Xaa at position 464 is Arg, Gly, His, Gln, Thr or Phe; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly, Arg, Thr, His, Cys, Trp, Phe or Leu; Xaa at position 466 is Asp or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 497 is Asp or Gln; Xaa at position 499 is Glu or Gln; Xaa at position 500 is Arg, Gln or Lys; Xaa at position 502 is Arg, Glu or Gln; Xaa at position 509 is Lys, Gln or Glu; Xaa at position 517 is Gln, Cys, Asn, Val or Pro; Xaa at position 518 is Glu or Gln; Xaa at position 520 is Lys, Gln or Glu; Xaa at position 525 is Gln or Lys; and Xaa at position 527 is Gln, Lys, Pro, Cys, Glu, Ser, His, Phe or Trp; and having one or more amino acid substitutions at positions designated as Xaa in SEQ ID NO: 4 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 2; and amino acid substitutions, deletions, insertions, and fragments thereof, and combinations thereof.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence of SEQ ID NO: 4 having 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60 or 61 amino acid substitutions, in any combination, at residues designated by Xaa in SEQ ID NO: 4 compared to the native amino acid at the corresponding position of SEQ ID NO: 2.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence of SEQ ID NO: 4 having 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 amino acid substitutions, in any combination, at residues designated by Xaa in SEQ ID NO: 4 compared to the native amino acid at the corresponding position of SEQ ID NO: 2.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater sequence identity to the amino acid sequence of SEQ ID NO: 4.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence of SEQ ID NO: 3, wherein Xaa at position 24 is Asp or Asn; Xaa at position 42 is Asp or Asn; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu or Asn; Xaa at position 74 is Lys, Glu or Gly; Xaa at position 79 is Lys or Glu; Xaa at position 82 is Glu, Ile, Leu or Tyr; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln or Gly; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile or Met; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp or Asn; Xaa at position 147 is Arg or Glu; Xaa at position 148 is Asp, Phe or Pro; Xaa at position 150 is Arg or Gln; Xaa at position 151 is Asp, Ser, Ala or Asn; Xaa at position 153 is Leu or Ile; Xaa at position 162 is Glu or Gln; Xaa at position 165 is Lys, Glu or Gln; Xaa at position 166 is Arg or Gln; Xaa at position 171 is Tyr or Phe; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 196 is Gln, Lys, Asn or Asp; Xaa at position 203 is Glu, Thr or His; Xaa at position 206 is Tyr or Phe; Xaa at position 216 is Glu or Gln; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser or Gln; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln or Lys; Xaa at position 257 is Gln or Ile; Xaa at position 261 is Gln or Glu; Xaa at position 278 is Glu or Asn; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr or Asp; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 313 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 333 is Ser, Gly, Lys, Val or Asn; Xaa at position 334 is Gly, Arg, Lys or Ile; Xaa at position 336 is Gly or Ala; Xaa at position 338 is Ser, His, Val, Lys or Ala; Xaa at position 339 is Glu, Asn, Ile or Pro; Xaa at position 343 is Val or Ile; Xaa at position 346 is Pro or Ala; Xaa at position 355 is Val or Ile; Xaa at position 359 is Gly or Ala; Xaa at position 391 is Arg, Glu or Gln; Xaa at position 396 is Ala, Leu, Lys, Asn or Gly; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val or Arg; Xaa at position 402 is Lys, Phe, His, Arg, Gly, Trp, Thr, Asn, Tyr or Met; Xaa at position 403 is Asp or Tyr; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg or Glu; Xaa at position 417 is Ala or Ser; Xaa at position 418 is Leu or Met; Xaa at position 426 is Thr or Ser; Xaa at position 440 is Val or Leu; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu or Ile; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp or Gln; Xaa at position 455 is Val or Ile; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile or Tyr; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala or Ser; Xaa at position 464 is Arg, Gly or His; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 500 is Arg or Gln; Xaa at position 509 is Lys or Gln; Xaa at position 520 is Lys, Gln or Glu; and Xaa at position 527 is Gln, Lys, Pro, Cys or Glu; and having one or more amino acid substitutions at positions designated as Xaa in SEQ ID NO: 3 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO: 35; and amino acid substitutions, deletions, insertions, and fragments thereof, and combinations thereof.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence of SEQ ID NO: 3 having 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75 or 76 amino acid substitutions, in any combination, at residues designated by Xaa in SEQ ID NO: 3 compared to the native amino acid at the corresponding position of SEQ ID NO: 2.

In some embodiments PHI-4 polypeptide comprises an amino acid sequence of SEQ ID NO: 3 having 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53 or 54 amino acid substitutions, in any combination, at residues designated by Xaa in SEQ ID NO: 3 compared to the native amino acid at the corresponding position of SEQ ID NO: 2.

In some embodiments PHI-4 polypeptide comprises an amino acid sequence of SEQ ID NO: 3 having 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 11, 12, 13, 14 or 15 amino acid substitutions, in any combination, at residues designated by Xaa in SEQ ID NO: 3 compared to the native amino acid at the corresponding position of SEQ ID NO: 2.

In some embodiments a PHI-4 polypeptide comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater sequence identity to the amino acid sequence of SEQ ID NO: 3.

In some embodiments a PHI-4 polypeptide comprises one or more amino acid substitutions compared to the native amino acid at position 40, 42, 43, 46, 52, 97, 98, 99, 145, 150, 151, 153, 163, 171, 172, 182, 196, 206, 210, 216, 220, 278, 283, 289, 293, 328, 333, 334, 336, 338, 339, 342, 346, 354, 355, 370, 389, 393, 396, 401, 402, 403, 410, 412, 416, 417, 426, 442, 447, 452, 454, 455, 457, 461, 462, 500, 509, 520 or 527 of SEQ ID NO: 35.

In some embodiments a PHI-4 polypeptide comprises one or more amino acid substitutions compared to the native amino acid at position 40, 42, 43, 46, 52, 97, 98, 99, 145, 150, 151, 153, 163, 171, 172, 182, 196, 206, 210, 216, 220, 278, 283, 289, 293, 328, 333, 334, 336, 338, 339, 342, 346, 354, 355, 370, 389, 393, 396, 401, 402, 403, 410, 412, 416, 417, 426, 442, 447, 452, 454, 455, 457, 461, 462, 500, 509, 520 or 527 of SEQ ID NO: 35 wherein the amino acid at position 40 is Leu or Ile; the amino acid at position 42 is Asp or Asn; the amino acid at position 43 is Phe or Glu; the amino acid at position 46 is Glu or Asn; the amino acid at position 52 is Ile or Val; the amino acid at position 97 is Arg, Asp, Glu or Asn; the amino acid at position 98 is Tyr or Phe; the amino acid at position 99 is Lys or Leu; the amino acid at position 145 is Leu or Val; the amino acid at position 150 is Arg or Gln; the amino acid at position 151 is Asp or Ser; the amino acid at position 153 is Leu or Ile; the amino acid at position 163 is Leu or Val; the amino acid at position 171 is Tyr or Phe; the amino acid at position 172 is Ile or Leu; the amino acid at position 182 is Asp or Gln; the amino acid at position 196 is Gln or Asn; the amino acid at position 206 is Tyr or Phe; the amino acid at position 210 is Val or Ile; the amino acid at position 216 is Glu or Gln; the amino acid at position 220 is Glu, Gln, His or Asp; the amino acid at position 278 is Glu or Asn; the amino acid at position 283 is Ile or Val; the amino acid at position 289 is Lys, Gln or Leu; the amino acid at position 293 is Arg, Gln or Glu; the amino acid at position 328 is Lys or Glu; the amino acid at position 333 is Ser, Lys or Val; the amino acid at position 334 is Gly, Lys or Arg; the amino acid at position 336 is Gly or Ala; the amino acid at position 338 is Ser or Val; the amino acid at position 339 is Glu, Asn or Gln; the amino acid at position 342 is Ala or Ser; the amino acid at position 346 is Pro or Ala; the amino acid at position 354 is Met or Leu; the amino acid at position 355 is Val or Ile; the amino acid at position 370 is His or Arg; the amino acid at position 389 is Trp or Leu; the amino acid at position 393 is Trp or Leu; the amino acid at position 396 is Ala, Leu, Lys, Thr or Gly; the amino acid at position 401 is Ser, His, Gly, Lys or Pro; the amino acid at position 402 is Lys, His, Gly or Trp; the amino acid at position 403 is Asp or Tyr; the amino acid at position 410 is Ile or Val; the amino acid at position 412 is Pro or Ala; the amino acid at position 416 is Arg or Glu; the amino acid at position 417 is Ala or Ser; the amino acid at position 426 is Thr or Ser; the amino acid at position 442 is Gln or Glu; the amino acid at position 447 is Asp or Lys; the amino acid at position 452 is Gln or Lys; the amino acid at position 454 is Arg or Gln; the amino acid at position 455 is Val or Ile; the amino acid at position 457 is Trp or Asn; the amino acid at position 461 is Thr or Ser; the amino acid at position 462 is Gly or Ala; the amino acid at position 500 is Arg or Gln; the amino acid at position 509 is Lys or Gln; the amino acid at position 520 is Lys, Glu or Gln; and the amino acid at position 527 is Gln or Lys., and amino acid deletions, insertions and fragments thereof, and combinations thereof.

In some embodiments the PHI-4 polypeptide comprising one or more amino acid substitutions at position 86, 359, 399, 464, 465, 466, 467, 468, 499 or 517.

In some embodiments the PHI-4 polypeptide comprising one or more amino acid substitutions at position 86, 359, 399, 464, 465, 466, 467, 468, 499 or 517, wherein the amino acid at position 86 is Glu or Thr; the amino acid at position 359 is Gly or Ala; the amino acid at position 399 is Gly or Ala; the amino acid at position 464 is Arg, Ala, Lys, Asp or Asn; the amino acid at position 465 is Lys or Met, the amino acid at position 467 is Val, Ala, Leu or Thr; the amino acid at position 468 is Ser or Leu; the amino acid at position 499 is Glu or Ala, or the amino acid at position 517 is Glu or Arg.

In some embodiments exemplary PHI-4 polypeptides are encoded by the polynucleotide sequence set forth in SEQ ID NO: 1, SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID NOS: 24-30.

In some embodiments a PHI-4 polypeptide includes variants where an amino acid that is part of a proteolytic cleavage site is changed to another amino acid to eliminate or alter the proteolytic cleavage at that site. In some embodiments the proteolytic cleavage is by a protease in the insect gut. In other embodiments the proteolytic cleavage is by a plant protease in the transgenic plant.

In some embodiments exemplary PHI-4 polypeptides are the polypeptides shown in Table 3, Table 4, Table 5, Table 6, Table 7, Megatable 1, and Megatable 2 and combinations of the amino substitutions thereof as well as amino acid deletions, and or insertions and fragments thereof.

In some embodiments a PHI-4 polypeptide is encoded by a nucleic acid molecule that hybridizes under stringent conditions to the nucleic acid molecule of SEQ ID NO: 1, SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID NOS: 24-30. Variants include polypeptides that differ in amino acid sequence due to mutagenesis. Variant proteins encompassed by the disclosure are biologically active, that is they continue to possess the desired biological activity (i.e. pesticidal activity) of the native protein. By “retains activity” is intended that the variant will have at least about 30%, at least about 50%, at least about 70% or at least about 80% of the insecticidal activity of the native protein. In some embodiments, the variants may have improved activity over the native protein.

Bacterial genes quite often possess multiple methionine initiation codons in proximity to the start of the open reading frame. Often, translation initiation at one or more of these start codons will lead to generation of a functional protein. These start codons can include ATG codons. However, bacteria such as Bacillus sp. also recognize the codon GTG as a start codon, and proteins that initiate translation at GTG codons contain a methionine at the first amino acid. On rare occasions, translation in bacterial systems can initiate at a TTG codon, though in this event the TTG encodes a methionine. Furthermore, it is not often determined a priori which of these codons are used naturally in the bacterium. Thus, it is understood that use of one of the alternate methionine codons may also lead to generation of pesticidal proteins. These pesticidal proteins are encompassed in the present disclosure and may be used in the methods of the present disclosure. It will be understood that, when expressed in plants, it will be necessary to alter the alternate start codon to ATG for proper translation.

In another aspect the PHI-4 polypeptide may be expressed as a precursor protein with an intervening sequence that catalyzes multi-step, post translational protein splicing. Protein splicing involves the excision of an intervening sequence from a polypeptide with the concomitant joining of the flanking sequences to yield a new polypeptide (Chong, et al., (1996) J. Biol. Chem. 271:22159-22168). This intervening sequence or protein splicing element, referred to as inteins, which catalyze their own excision through three coordinated reactions at the N-terminal and C-terminal splice junctions: an acyl rearrangement of the N-terminal cysteine or Serine; a transesterfication reaction between the two termini to form a branched ester or thioester intermediate and peptide bond cleavage coupled to cyclization of the intein C-terminal asparagine to free the intein (Evans, et al., (2000) J. Biol. Chem. 275:9091-9094. The elucidation of the mechanism of protein splicing has led to a number of intein-based applications (Comb, et al., U.S. Pat. No. 5,496,714; Comb, et al., U.S. Pat. No. 5,834,247; Camarero and Muir, (1999) J. Amer. Chem. Soc. 121:5597-5598; Chong, et al., (1997) Gene 192:271-281, Chong, et al., (1998) Nucleic Acids Res. 26:5109-5115; Chong, et al., (1998) J. Biol. Chem. 273:10567-10577; Cotton, et al., (1999) J. Am. Chem. Soc. 121:1100-1101; Evans, et al., (1999) J. Biol. Chem. 274:18359-18363; Evans, et al., (1999) J. Biol. Chem. 274:3923-3926; Evans, et al., (1998) Protein Sci. 7:2256-2264; Evans, et al., (2000) J. Biol. Chem. 275:9091-9094; Iwai and Pluckthun, (1999) FEBS Lett. 459:166-172; Mathys, et al., (1999) Gene 231:1-13; Mills, et al., (1998) Proc. Natl. Acad. Sci. USA 95:3543-3548; Muir, et al., (1998) Proc. Natl. Acad. Sci. USA 95:6705-6710; Otomo, et al., (1999) Biochemistry 38:16040-16044; Otomo, et al., (1999) J. Biolmol. NMR 14:105-114; Scott, et al., (1999) Proc. Natl. Acad. Sci. USA 96:13638-13643; Severinov and Muir, (1998) J. Biol. Chem. 273:16205-16209; Shingledecker, et al., (1998) Gene 207:187-195; Southworth, et al., (1998) EMBO J. 17:918-926; Southworth, et al., (1999) Biotechniques 27:110-120; Wood, et al., (1999) Nat. Biotechnol. 17:889-892; Wu, et al., (1998a) Proc. Natl. Acad. Sci. USA 95:9226-9231; Wu, et al., (1998b) Biochim Biophys Acta 1387:422-432; Xu, et al., (1999) Proc. Natl. Acad. Sci. USA 96:388-393; Yamazaki, et al., (1998) J. Am. Chem. Soc. 120:5591-5592). For the application of inteins in plant transgenes see Yang, J, et al., (Transgene Res 15:583-593 (2006)) and Evans, et al., (Annu. Rev. Plant Biol. 56:375-392, (2005)).

In another aspect the PHI-4 polypeptide may be encoded by two separate genes where the intein of the precursor protein comes from the two genes, referred to as a split-intein and the two portions of the precursor are joined by a peptide bond formation. This peptide bond formation is accomplished by intein-mediated trans-splicing. For this purpose, a first and a second expression cassette comprising the two separate genes further code for inteins capable of mediating protein trans-splicing. By trans-splicing, the proteins and polypeptides encoded by the first and second fragments may be linked by peptide bond formation. Trans-splicing inteins may be selected from the nucleolar and organellar genomes of different organisms including eukaryotes, archaebacteria and eubacteria. Inteins that may be used for are listed at neb.com/neb/inteins.html, which can be accessed on the world-wide web using the “www” prefix). The nucleotide sequence coding for an intein may be split into a 5′ and a 3′ part that code for the 5′ and the 3′ part of the intein, respectively. Sequence portions not necessary for intein splicing (e.g., homing endonuclease domain) may be deleted. The intein coding sequence is split such that the 5′ and the 3′ parts are capable of trans-splicing. For selecting a suitable splitting site of the intein coding sequence, the considerations published by Southworth, et al., (1998) EMBO J. 17:918-926 may be followed. In constructing the first and the second expression cassette, the 5′ intein coding sequence is linked to the 3′ end of the first fragment coding for the N-terminal part of the PHI-4 polypeptide and the 3′ intein coding sequence is linked to the 5′ end of the second fragment coding for the C-terminal part of the PHI-4 polypeptide.

In general, the trans-splicing partners can be designed using any split intein, including any naturally-occurring or artificially-split split intein. Several naturally-occurring split inteins are known, for example: the split intein of the DnaE gene of Synechocystis sp. PCC6803 (see, Wu, et al., (1998) Proc Natl Acad Sci USA 95(16):9226-31 and Evans, et al., (2000) J Biol Chem 275(13):9091-4 and of the DnaE gene from Nostoc punctiforme (see, Iwai, et al., (2006) FEBS Lett 580(7):1853-8). Non-split inteins have been artificially split in the laboratory to create new split inteins, for example: the artificially split Ssp DnaB intein (see, Wu, et al., (1998) Biochim Biophys Acta 1387:422-32) and split Sce VMA intein (see, Brenzel, et al., (2006) Biochemistry 45(6):1571-8) and an artificially split fungal mini-intein (see, Elleuche, et al., (2007) Biochem Biophys Res Commun 355(3):830-4). There are also intein databases available that catalogue known inteins (see, for example the online-database available at: bioinformatics.weizmann.ac.il/pietro/inteins/Inteinstable.html, which can be accessed on the world-wide web using the “www” prefix).

Naturally-occurring non-split inteins may have endonuclease or other enzymatic activities that can typically be removed when designing an artificially-split split intein. Such mini-inteins or minimized split inteins are well known in the art and are typically less than 200 amino acid residues long (see, Wu, et al., (1998) Biochim Biophys Acta 1387:422-32). Suitable split inteins may have other purification enabling polypeptide elements added to their structure, provided that such elements do not inhibit the splicing of the split intein or are added in a manner that allows them to be removed prior to splicing. Protein splicing has been reported using proteins that comprise bacterial intein-like (BIL) domains (see, Amitai, et al., (2003) Mol Microbiol 47:61-73) and hedgehog (Hog) auto-processing domains (the latter is combined with inteins when referred to as the Hog/intein superfamily or HINT family (see, Dassa, et al., (2004) J Biol Chem. 279 32001-7) and domains such as these may also be used to prepare artificially-split inteins. In particular, non-splicing members of such families may be modified by molecular biology methodologies to introduce or restore splicing activity in such related species. Recent studies demonstrate that splicing can be observed when a N-terminal split intein component is allowed to react with a C-terminal split intein component not found in nature to be its “partner”; for example, splicing has been observed utilizing partners that have as little as 30 to 50% homology with the “natural” splicing partner (see, Dassa, et al., (2007) Biochemistry 46(1):322-30). Other such mixtures of disparate split intein partners have been shown to be unreactive one with another (see, Brenzel, et al., 2006 Biochemistry 45(6):1571-8). However, it is within the ability of a person skilled in the relevant art to determine whether a particular pair of polypeptides is able to associate with each other to provide a functional intein, using routine methods and without the exercise of inventive skill.

In another aspect the PHI-4 polypeptide is a circular permuted variant. In certain embodiments the PHI-4 polypeptide is a circular permuted variant of the polypeptide of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819. The development of recombinant DNA methods has made it possible to study the effects of sequence transposition on protein folding, structure and function. The approach used in creating new sequences resembles that of naturally occurring pairs of proteins that are related by linear reorganization of their amino acid sequences (Cunningham, et al., (1979) Proc. Natl. Acad. Sci. U.S.A. 76:3218-3222; Teather and Erfle, (1990) J. Bacteriol. 172:3837-3841; Schimming, et al., (1992) Eur. J. Biochem. 204:13-19; Yamiuchi and Minamikawa, (1991) FEBS Lett. 260:127-130; MacGregor, et al., (1996) FEBS Lett. 378:263-266). The first in vitro application of this type of rearrangement to proteins was described by Goldenberg and Creighton (J. Mol. Biol. 165:407-413, 1983). In creating a circular permuted variant a new N-terminus is selected at an internal site (breakpoint) of the original sequence, the new sequence having the same order of amino acids as the original from the breakpoint until it reaches an amino acid that is at or near the original C-terminus. At this point the new sequence is joined, either directly or through an additional portion of sequence (linker), to an amino acid that is at or near the original N-terminus and the new sequence continues with the same sequence as the original until it reaches a point that is at or near the amino acid that was N-terminal to the breakpoint site of the original sequence, this residue forming the new C-terminus of the chain. The length of the amino acid sequence of the linker can be selected empirically or with guidance from structural information, or by using a combination of the two approaches. When no structural information is available, a small series of linkers can be prepared for testing using a design whose length is varied in order to span a range from 0 to 50 Å and whose sequence is chosen in order to be consistent with surface exposure (hydrophilicity, Hopp and Woods, (1983) Mol. Immunol. 20:483-489; Kyte and Doolittle, (1982) J. Mol. Biol. 157:105-132; solvent exposed surface area, Lee and Richards, (1971) J. Mol. Biol. 55:379-400) and the ability to adopt the necessary conformation without deranging the configuration of the pesticidal polypeptide (conformationally flexible; Karplus and Schulz, (1985) Naturwissenschaften 72:212-213. Assuming an average of translation of 2.0 to 3.8 Å per residue, this would mean the length to test would be between 0 to 30 residues, with 0 to 15 residues being the preferred range. Exemplary of such an empirical series would be to construct linkers using a cassette sequence such as Gly-Gly-Gly-Ser repeated n times, where n is 1, 2, 3 or 4. Those skilled in the art will recognize that there are many such sequences that vary in length or composition that can serve as linkers with the primary consideration being that they be neither excessively long nor short (cf., Sandhu, (1992) Critical Rev. Biotech. 12:437-462); if they are too long, entropy effects will likely destabilize the three-dimensional fold, and may also make folding kinetically impractical, and if they are too short, they will likely destabilize the molecule because of torsional or steric strain. Those skilled in the analysis of protein structural information will recognize that using the distance between the chain ends, defined as the distance between the c-alpha carbons, can be used to define the length of the sequence to be used, or at least to limit the number of possibilities that must be tested in an empirical selection of linkers. They will also recognize that it is sometimes the case that the positions of the ends of the polypeptide chain are ill-defined in structural models derived from x-ray diffraction or nuclear magnetic resonance spectroscopy data, and that when true, this situation will therefore need to be taken into account in order to properly estimate the length of the linker required. From those residues whose positions are well defined are selected two residues that are close in sequence to the chain ends, and the distance between their c-alpha carbons is used to calculate an approximate length for a linker between them. Using the calculated length as a guide, linkers with a range of number of residues (calculated using 2 to 3.8 Å per residue) are then selected. These linkers may be composed of the original sequence, shortened or lengthened as necessary, and when lengthened the additional residues may be chosen to be flexible and hydrophilic as described above; or optionally the original sequence may be substituted for using a series of linkers, one example being the Gly-Gly-Gly-Ser cassette approach mentioned above; or optionally a combination of the original sequence and new sequence having the appropriate total length may be used. Sequences of pesticidal polypeptides capable of folding to biologically active states can be prepared by appropriate selection of the beginning (amino terminus) and ending (carboxyl terminus) positions from within the original polypeptide chain while using the linker sequence as described above. Amino and carboxyl termini are selected from within a common stretch of sequence, referred to as a breakpoint region, using the guidelines described below. A novel amino acid sequence is thus generated by selecting amino and carboxyl termini from within the same breakpoint region. In many cases the selection of the new termini will be such that the original position of the carboxyl terminus immediately preceded that of the amino terminus. However, those skilled in the art will recognize that selections of termini anywhere within the region may function, and that these will effectively lead to either amino acid deletions or additions to the amino or carboxyl portions of the new sequence. It is a central tenet of molecular biology that the primary amino acid sequence of a protein dictates folding to the three-dimensional structure necessary for expression of its biological function. Methods are known to those skilled in the art to obtain and interpret three-dimensional structural information using x-ray diffraction of single protein crystals or nuclear magnetic resonance spectroscopy of protein solutions. Examples of structural information that are relevant to the identification of breakpoint regions include the location and type of protein secondary structure (alpha and 3-10 helices, parallel and anti-parallel beta sheets, chain reversals and turns, and loops; Kabsch and Sander, (1983) Biopolymers 22:2577-2637; the degree of solvent exposure of amino acid residues, the extent and type of interactions of residues with one another (Chothia, (1984) Ann. Rev. Biochem. 53:537-572) and the static and dynamic distribution of conformations along the polypeptide chain (Alber and Mathews, (1987) Methods Enzymol. 154:511-533). In some cases additional information is known about solvent exposure of residues; one example is a site of post-translational attachment of carbohydrate which is necessarily on the surface of the protein. When experimental structural information is not available, or is not feasible to obtain, methods are also available to analyze the primary amino acid sequence in order to make predictions of protein tertiary and secondary structure, solvent accessibility and the occurrence of turns and loops. Biochemical methods are also sometimes applicable for empirically determining surface exposure when direct structural methods are not feasible; for example, using the identification of sites of chain scission following limited proteolysis in order to infer surface exposure (Gentile and Salvatore, (1993) Eur. J. Biochem. 218:603-621). Thus using either the experimentally derived structural information or predictive methods (e.g., Srinivisan and Rose, (1995) Proteins: Struct., Funct. & Genetics 22:81-99) the parental amino acid sequence is inspected to classify regions according to whether or not they are integral to the maintenance of secondary and tertiary structure. The occurrence of sequences within regions that are known to be involved in periodic secondary structure (alpha and 3-10 helices, parallel and anti-parallel beta sheets) are regions that should be avoided. Similarly, regions of amino acid sequence that are observed or predicted to have a low degree of solvent exposure are more likely to be part of the so-called hydrophobic core of the protein and should also be avoided for selection of amino and carboxyl termini. In contrast, those regions that are known or predicted to be in surface turns or loops, and especially those regions that are known not to be required for biological activity, are the preferred sites for location of the extremes of the polypeptide chain. Continuous stretches of amino acid sequence that are preferred based on the above criteria are referred to as a breakpoint region. Polynucleotides encoding circular permuted PHI-4 polypeptides with new N-terminus/C-terminus which contain a linker region separating the original C-terminus and N-terminus can be made essentially following the method described in Mullins, et al., (1994) J. Am. Chem. Soc. 116:5529-5533. Multiple steps of polymerase chain reaction (PCR) amplifications are used to rearrange the DNA sequence encoding the primary amino acid sequence of the protein. Polynucleotides encoding circular permuted PHI-4 polypeptides with new N-terminus/C-terminus which contain a linker region separating the original C-terminus and N-terminus can be made based on the tandem-duplication method described in Horlick, et al., (1992) Protein Eng. 5:427-431. Polymerase chain reaction (PCR) amplification of the new N-terminus/C-terminus genes is performed using a tandemly duplicated template DNA.

In another aspect fusion proteins are provided that include within its amino acid sequence an amino acid sequence comprising a PHI-4 polypeptide including but not limited to the polypeptide of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819 and active fragments thereof.

In another aspect fusion proteins are provided comprising a PHI-4 polypeptide and a second pesticidal polypeptide such a Cry protein. Methods for design and construction of fusion proteins (and polynucleotides encoding same) are known to those of skill in the art. Polynucleotides encoding a PHI-4 polypeptide may be fused to signal sequences which will direct the localization of the PHI-4 polypeptide to particular compartments of a prokaryotic or eukaryotic cell and/or direct the secretion of the PHI-4 polypeptide of the embodiments from a prokaryotic or eukaryotic cell. For example, in E. coli, one may wish to direct the expression of the protein to the periplasmic space. Examples of signal sequences or proteins (or fragments thereof) to which the PHI-4 polypeptide may be fused in order to direct the expression of the polypeptide to the periplasmic space of bacteria include, but are not limited to, the pelB signal sequence, the maltose binding protein (MBP) signal sequence, MBP, the ompA signal sequence, the signal sequence of the periplasmic E. coli heat-labile enterotoxin B-subunit, and the signal sequence of alkaline phosphatase. Several vectors are commercially available for the construction of fusion proteins which will direct the localization of a protein, such as the pMAL series of vectors (particularly the pMAL-p series) available from New England Biolabs. In a specific embodiment, the PHI-4 polypeptide may be fused to the pelB pectate lyase signal sequence to increase the efficiency of expression and purification of such polypeptides in Gram-negative bacteria (see, U.S. Pat. Nos. 5,576,195 and 5,846,818). Plant plastid transit peptide/polypeptide fusions are well known in the art (see, U.S. Pat. No. 7,193,133). Apoplast transit peptides such as rice or barley alpha-amylase secretion signal are also well known in the art. The plastid transit peptide is generally fused N-terminal to the polypeptide to be targeted (e.g., the fusion partner). In one embodiment, the fusion protein consists essentially of the peptide transit plastid and the PHI-4 polypeptide to be targeted. In another embodiment, the fusion protein comprises the peptide transit plastid and the polypeptide to be targeted. In such embodiments, the plastid transit peptide is preferably at the N-terminus of the fusion protein. However, additional amino acid residues may be N-terminal to the plastid transit peptide providing that the fusion protein is at least partially targeted to a plastid. In a specific embodiment, the plastid transit peptide is in the N-terminal half, N-terminal third or N-terminal quarter of the fusion protein. Most or all of the plastid transit peptide is generally cleaved from the fusion protein upon insertion into the plastid. The position of cleavage may vary slightly between plant species, at different plant developmental stages, as a result of specific intercellular conditions or the particular combination of transit peptide/fusion partner used. In one embodiment, the plastid transit peptide cleavage is homogenous such that the cleavage site is identical in a population of fusion proteins. In another embodiment, the plastid transit peptide is not homogenous, such that the cleavage site varies by 1-10 amino acids in a population of fusion proteins. The plastid transit peptide can be recombinantly fused to a second protein in one of several ways. For example, a restriction endonuclease recognition site can be introduced into the nucleotide sequence of the transit peptide at a position corresponding to its C-terminal end and the same or a compatible site can be engineered into the nucleotide sequence of the protein to be targeted at its N-terminal end. Care must be taken in designing these sites to ensure that the coding sequences of the transit peptide and the second protein are kept “in frame” to allow the synthesis of the desired fusion protein. In some cases, it may be preferable to remove the initiator methionine codon of the second protein when the new restriction site is introduced. The introduction of restriction endonuclease recognition sites on both parent molecules and their subsequent joining through recombinant DNA techniques may result in the addition of one or more extra amino acids between the transit peptide and the second protein. This generally does not affect targeting activity as long as the transit peptide cleavage site remains accessible and the function of the second protein is not altered by the addition of these extra amino acids at its N-terminus. Alternatively, one skilled in the art can create a precise cleavage site between the transit peptide and the second protein (with or without its initiator methionine) using gene synthesis (Stemmer, et al., (1995) Gene 164:49-53) or similar methods. In addition, the transit peptide fusion can intentionally include amino acids downstream of the cleavage site. The amino acids at the N-terminus of the mature protein can affect the ability of the transit peptide to target proteins to plastids and/or the efficiency of cleavage following protein import. This may be dependent on the protein to be targeted. See, e.g., Comai, et al., (1988) J. Biol. Chem. 263(29):15104-9.

In some embodiments fusion proteins are provide comprising a PHI-4 polypeptide, a pesticidal protein such as a cry protein, and an amino acid linker.

In some embodiments fusion proteins are provided represented by a formula selected from the group consisting of

R¹-L-R²,R²-L-R¹,R¹-R² or R²-R¹

where R¹ is a PHI-4 polypeptide, R² is a pesticidal protein with a different but complementary activity to the PHI-4 polypeptide, including but not limited to cry proteins; a polypeptide that increases the solubility and/or stability of the PHI-4 polypeptide; or a transit peptide or leader sequence. The R¹ polypeptide is fused either directly or through a linker segment to the R² polypeptide. The term “directly” defines fusions in which the polypeptides are joined without a peptide linker. Thus L represents a chemical bound or polypeptide segment to which both R¹ and R² are fused in frame, most commonly L is a linear peptide to which R¹ and R² are bound by amide bonds linking the carboxy terminus of R¹ to the amino terminus of L and carboxy terminus of L to the amino terminus of R². By “fused in frame” is meant that there is no translation termination or disruption between the reading frames of R¹ and R². The linking group (L) is generally a polypeptide of between 1 and 500 amino acids in length. The linkers joining the two molecules are preferably designed to (1) allow the two molecules to fold and act independently of each other, (2) not have a propensity for developing an ordered secondary structure which could interfere with the functional domains of the two proteins, (3) have minimal hydrophobic or charged characteristic which could interact with the functional protein domains and (4) provide steric separation of R¹ and R² such that R¹ and R² could interact simultaneously with their corresponding receptors on a single cell. Typically surface amino acids in flexible protein regions include Gly, Asn and Ser. Virtually any permutation of amino acid sequences containing Gly, Asn and Ser would be expected to satisfy the above criteria for a linker sequence. Other neutral amino acids, such as Thr and Ala, may also be used in the linker sequence. Additional amino acids may also be included in the linkers due to the addition of unique restriction sites in the linker sequence to facilitate construction of the fusions.

In some embodiments the linkers comprise sequences selected from the group of formulas: (Gly₃Ser)_(n), (Gly₄Ser)_(n), (Gly₅Ser)_(n), (Gly_(n)Ser)_(n) or (AlaGlySer)_(n) where n is an integer. One example of a highly-flexible linker is the (GlySer)-rich spacer region present within the pill protein of the filamentous bacteriophages, e.g., bacteriophages M13 or fd (Schaller, et al., 1975). This region provides a long, flexible spacer region between two domains of the pill surface protein. Also included are linkers in which an endopeptidase recognition sequence is included. Such a cleavage site may be valuable to separate the individual components of the fusion to determine if they are properly folded and active in vitro. Examples of various endopeptidases include, but are not limited to, Plasmin, Enterokinase, Kallikerin, Urokinase, Tissue Plasminogen activator, clostripain, Chymosin, Collagenase, Russell's Viper Venom Protease, Postproline cleavage enzyme, V8 protease, Thrombin and factor Xa. In some embodiments the linker comprises the amino acids EEKKN from the multi-gene expression vehicle (MGEV), which is cleaved by vacuolar proteases as disclosed in US 2007/0277263. In other embodiments, peptide linker segments from the hinge region of heavy chain immunoglobulins IgG, IgA, IgM, IgD or IgE provide an angular relationship between the attached polypeptides. Especially useful are those hinge regions where the cysteines are replaced with serines. Preferred linkers of the present invention include sequences derived from murine IgG gamma 2b hinge region in which the cysteines have been changed to serines. The fusion proteins are not limited by the form, size or number of linker sequences employed and the only requirement of the linker is that functionally it does not interfere adversely with the folding and function of the individual molecules of the fusion.

In another aspect chimeric PHI-4 polypeptide are provided that are created through joining two or more portions of genes, which originally encoded separate insecticidal proteins from different species, to create a chimeric gene. The translation of the chimeric gene results in a single chimeric pesticidal polypeptide with regions, motifs or domains derived from each of the original polypeptides. In certain embodiments the chimeric protein comprises portions, motifs, or domains of PHI-4 polypeptides in any combination. In certain embodiments the chimeric insecticidal polypeptide includes but not limited to the polypeptides of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819.

It is recognized that DNA sequences may be altered by various methods, and that these alterations may result in DNA sequences encoding proteins with amino acid sequences different than that encoded by the wild-type (or native) pesticidal protein. These proteins may be altered in various ways including amino acid substitutions, amino acid deletions, amino acid truncations, and insertions of one or more amino acids, including up to 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155 or more amino acid substitutions, amino acid deletions and/or insertions or combinations thereof compared to SEQ ID NO: 35 SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819. In some embodiments a PHI-4 polypeptide comprises the deletion of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28 or more amino acids from the C-terminus of the PHI-4 polypeptide relative to the amino acid position of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 or SEQ ID NOs: 51-819. Methods for such manipulations are generally known in the art. For example, amino acid sequence variants of a PHI-4 polypeptide can be prepared by mutations in the DNA. This may also be accomplished by one of several forms of mutagenesis and/or in directed evolution. In some aspects, the changes encoded in the amino acid sequence will not substantially affect the function of the protein. Such variants will possess the desired pesticidal activity. However, it is understood that the ability of a PHI-4 polypeptide to confer pesticidal activity may be improved by the use of such techniques upon the compositions of this disclosure.

For example, conservative amino acid substitutions may be made at one or more, predicted, nonessential amino acid residues. A “nonessential” amino acid residue is a residue that can be altered from the wild-type sequence of a PHI-4 polypeptide without altering the biological activity. A “conservative amino acid substitution” is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include: amino acids with basic side chains (e.g., lysine, arginine, histidine); acidic side chains (e.g., aspartic acid, glutamic acid); polar, negatively charged residues and their amides (e.g., aspartic acid, asparagine, glutamic, acid, glutamine; uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine); small aliphatic, nonpolar or slightly polar residues (e.g., Alanine, serine, threonine, proline, glycine); nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan); large aliphatic, nonpolar residues (e.g., methionine, leucine, isoleucine, valine, cystine); beta-branched side chains (e.g., threonine, valine, isoleucine); aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine); large aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan).

Amino acid substitutions may be made in nonconserved regions that retain function. In general, such substitutions would not be made for conserved amino acid residues, or for amino acid residues residing within a conserved motif, where such residues are essential for protein activity. Examples of residues that are conserved and that may be essential for protein activity include, for example, residues that are identical between all proteins contained in an alignment of similar or related toxins to the sequences of the embodiments (e.g., residues that are identical in an alignment of homologous proteins). Examples of residues that are conserved but that may allow conservative amino acid substitutions and still retain activity include, for example, residues that have only conservative substitutions between all proteins contained in an alignment of similar or related toxins to the sequences of the embodiments (e.g., residues that have only conservative substitutions between all proteins contained in the alignment homologous proteins). However, one of skill in the art would understand that functional variants may have minor conserved or nonconserved alterations in the conserved residues. Guidance as to appropriate amino acid substitutions that do not affect biological activity of the protein of interest may be found in the model of Dayhoff, et al., (1978) Atlas of Protein Sequence and Structure (Natl. Biomed. Res. Found., Washington, D.C.), herein incorporated by reference.

In making such changes, the hydropathic index of amino acids may be considered. The importance of the hydropathic amino acid index in conferring interactive biologic function on a protein is generally understood in the art (Kyte and Doolittle, (1982) J Mol Biol. 157(1):105-32). It is accepted that the relative hydropathic character of the amino acid contributes to the secondary structure of the resultant protein, which in turn defines the interaction of the protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens and the like.

It is known in the art that certain amino acids may be substituted by other amino acids having a similar hydropathic index or score and still result in a protein with similar biological activity, i.e., still obtain a biological functionally equivalent protein. Each amino acid has been assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics (Kyte and Doolittle, ibid). These are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (−0.4); threonine (−0.7); serine (−0.8); tryptophan (−0.9); tyrosine (−1.3); proline (−1.6); histidine (−3.2); glutamate (−3.5); glutamine (−3.5); aspartate (−3.5); asparagine (−3.5); lysine (−3.9) and arginine (−4.5). In making such changes, the substitution of amino acids whose hydropathic indices are within +2 is preferred, those which are within +1 are particularly preferred and those within +0.5 are even more particularly preferred.

It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 4,554,101, states that the greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein.

As detailed in U.S. Pat. No. 4,554,101, the following hydrophilicity values have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0.+0.1); glutamate (+3.0.+0.1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (−0.4); proline (−0.5.+0.1); alanine (−0.5); histidine (−0.5); cysteine (−1.0); methionine (−1.3); valine (−1.5); leucine (−1.8); isoleucine (−1.8); tyrosine (−2.3); phenylalanine (−2.5); tryptophan (−3.4).

Alternatively, alterations may be made to the protein sequence of many proteins at the amino or carboxy terminus without substantially affecting activity. This can include amino acid insertions, amino acid deletions, or amino acid alterations introduced by modern molecular methods, such as PCR, including PCR amplifications that alter or extend the protein coding sequence by virtue of inclusion of amino acid encoding sequences in the oligonucleotides utilized in the PCR amplification. Alternatively, the protein sequences added can include entire protein-coding sequences, such as those used commonly in the art to generate protein fusions. Such fusion proteins are often used to (1) increase expression of a protein of interest (2) introduce a binding domain, enzymatic activity, or epitope to facilitate either protein purification, protein detection, or other experimental uses known in the art (3) target secretion or translation of a protein to a subcellular organelle, such as the periplasmic space of Gram-negative bacteria, mitochondria or chloroplasts of plants or the endoplasmic reticulum of eukaryotic cells, the latter of which often results in glycosylation of the protein.

In specific embodiments, the substitution is an alanine for the native amino acid at the recited position(s). Also encompassed are the nucleic acid sequence(s) encoding the variant protein or polypeptide.

Variant nucleotide and amino acid sequences of the disclosure also encompass sequences derived from mutagenic and recombinogenic procedures such as DNA shuffling. With such a procedure, one or more different PHI-4 polypeptide coding regions can be used to create a new PHI-4 polypeptide possessing the desired properties. In this manner, libraries of recombinant polynucleotides are generated from a population of related sequence polynucleotides comprising sequence regions that have substantial sequence identity and can be homologously recombined in vitro or in vivo. For example, using this approach, sequence motifs encoding a domain of interest may be shuffled between a pesticidal gene and other known pesticidal genes to obtain a new gene coding for a protein with an improved property of interest, such as an increased insecticidal activity. Strategies for such DNA shuffling are known in the art. See, for example, Stemmer, (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751; Stemmer, (1994) Nature 370:389-391; Crameri, et al., (1997) Nature Biotech. 15:436-438; Moore, et al., (1997) J. Mol. Biol. 272:336-347; Zhang, et al., (1997) Proc. Natl. Acad. Sci. USA 94:4504-4509; Crameri, et al., (1998) Nature 391:288-291 and U.S. Pat. Nos. 5,605,793 and 5,837,458.

Domain swapping or shuffling is another mechanism for generating altered PHI-4 polypeptides. Domains may be swapped between PHI-4 polypeptides, resulting in hybrid or chimeric toxins with improved pesticidal activity or target spectrum. Methods for generating recombinant proteins and testing them for pesticidal activity are well known in the art (see, for example, Naimov, et al., (2001) Appl. Environ. Microbiol. 67:5328-5330; de Maagd, et al., (1996) Appl. Environ. Microbiol. 62:1537-1543; Ge, et al., (1991) J. Biol. Chem. 266:17954-17958; Schnepf, et al., (1990) J. Biol. Chem. 265:20923-20930; Rang, et al., 91999) Appl. Environ. Microbiol. 65:2918-2925).

Antibodies

Antibodies to a PHI-4 polypeptide of the embodiments, or to variants or fragments thereof, are also encompassed. Methods for producing antibodies are well known in the art (see, for example, Harlow and Lane, (1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.; U.S. Pat. No. 4,196,265).

A kit for detecting the presence of a PHI-4 polypeptide, or detecting the presence of a nucleotide sequence encoding a PHI-4 polypeptide, in a sample is provided. In one embodiment, the kit provides antibody-based reagents for detecting the presence of a PHI-4 polypeptide in a tissue sample. In another embodiment, the kit provides labeled nucleic acid probes useful for detecting the presence of one or more polynucleotides encoding PHI-4 polypeptide(s). The kit is provided along with appropriate reagents and controls for carrying out a detection method, as well as instructions for use of the kit.”

Receptor Identification and Isolation

Receptors to the PHI-4 polypeptide of the embodiments, or to variants or fragments thereof, are also encompassed. Methods for identifying receptors are well known in the art (see, Hofmann, et. al., (1988) Eur. J. Biochem. 173:85-91; Gill, et al., (1995) J. Biol. Chem. 27277-27282) can be employed to identify and isolate the receptor that recognizes the PHI-4 polypeptides using the brush-border membrane vesicles from susceptible insects. In addition to the radioactive labeling method listed in the cited literatures, PHI-4 polypeptide can be labeled with fluorescent dye and other common labels such as streptavidin. Brush-border membrane vesicles (BBMV) of susceptible insects such as soybean looper and stink bugs can be prepared according to the protocols listed in the references and separated on SDS-PAGE gel and blotted on suitable membrane. Labeled PHI-4 polypeptides can be incubated with blotted membrane of BBMV and labeled the PHI-4 polypeptides can be identified with the labeled reporters. Identification of protein band(s) that interact with the PHI-4 polypeptides can be detected by N-terminal amino acid gas phase sequencing or mass spectrometry based protein identification method (Patterson, (1998) 10(22):1-24, Current Protocol in Molecular Biology published by John Wiley & Son Inc). Once the protein is identified, the corresponding gene can be cloned from genomic DNA or cDNA library of the susceptible insects and binding affinity can be measured directly with the PHI-4 polypeptides. Receptor function for insecticidal activity by the PHI-4 polypeptides can be verified by accomplished by RNAi type of gene knock out method (Rajagopal, et al., (2002) J. Biol. Chem. 277:46849-46851).

Nucleotide Constructs, Expression Cassettes and Vectors

The use of the term “nucleotide constructs” herein is not intended to limit the embodiments to nucleotide constructs comprising DNA. Those of ordinary skill in the art will recognize that nucleotide constructs particularly polynucleotides and oligonucleotides composed of ribonucleotides and combinations of ribonucleotides and deoxyribonucleotides may also be employed in the methods disclosed herein. The nucleotide constructs, nucleic acids, and nucleotide sequences of the embodiments additionally encompass all complementary forms of such constructs, molecules and sequences. Further, the nucleotide constructs, nucleotide molecules and nucleotide sequences of the embodiments encompass all nucleotide constructs, molecules and sequences which can be employed in the methods of the embodiments for transforming plants including, but not limited to, those comprised of deoxyribonucleotides, ribonucleotides and combinations thereof. Such deoxyribonucleotides and ribonucleotides include both naturally occurring molecules and synthetic analogues. The nucleotide constructs, nucleic acids, and nucleotide sequences of the embodiments also encompass all forms of nucleotide constructs including, but not limited to, single-stranded forms, double-stranded forms, hairpins, stem-and-loop structures and the like.

A further embodiment relates to a transformed organism such as an organism selected from plant and insect cells, bacteria, yeast, baculovirus, protozoa, nematodes and algae. The transformed organism comprises a DNA molecule of the embodiments, an expression cassette comprising the DNA molecule or a vector comprising the expression cassette, which may be stably incorporated into the genome of the transformed organism.

The sequences of the embodiments are provided in DNA constructs for expression in the organism of interest. The construct will include 5′ and 3′ regulatory sequences operably linked to a sequence of the embodiments. The term “operably linked” as used herein refers to a functional linkage between a promoter and a second sequence, wherein the promoter sequence initiates and mediates transcription of the DNA sequence corresponding to the second sequence. Generally, operably linked means that the nucleic acid sequences being linked are contiguous and where necessary to join two protein coding regions in the same reading frame. The construct may additionally contain at least one additional gene to be cotransformed into the organism. Alternatively, the additional gene(s) can be provided on multiple DNA constructs.

Such a DNA construct is provided with a plurality of restriction sites for insertion of the PHI-4 polypeptide gene sequence to be under the transcriptional regulation of the regulatory regions. The DNA construct may additionally contain selectable marker genes.

The DNA construct will generally include in the 5′ to 3′ direction of transcription: a transcriptional and translational initiation region (i.e., a promoter), a DNA sequence of the embodiments and a transcriptional and translational termination region (i.e., termination region) functional in the organism serving as a host. The transcriptional initiation region (i.e., the promoter) may be native, analogous, foreign or heterologous to the host organism and/or to the sequence of the embodiments. Additionally, the promoter may be the natural sequence or alternatively a synthetic sequence. The term “foreign” as used herein indicates that the promoter is not found in the native organism into which the promoter is introduced. Where the promoter is “foreign” or “heterologous” to the sequence of the embodiments, it is intended that the promoter is not the native or naturally occurring promoter for the operably linked sequence of the embodiments. As used herein, a chimeric gene comprises a coding sequence operably linked to a transcription initiation region that is heterologous to the coding sequence. Where the promoter is a native or natural sequence, the expression of the operably linked sequence is altered from the wild-type expression, which results in an alteration in phenotype.

In some embodiments the DNA construct may also include a transcriptional enhancer sequence. As used herein, the term an “enhancer” refers to a DNA sequence which can stimulate promoter activity and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue-specificity of a promoter. Various enhancers are known in the art including for example, introns with gene expression enhancing properties in plants (US Patent Application Publication Number 2009/0144863, the ubiquitin intron (i.e., the maize ubiquitin intron 1 (see, for example, NCBI sequence S94464)), the omega enhancer or the omega prime enhancer (Gallie, et al., (1989) Molecular Biology of RNA ed. Cech (Liss, New York) 237-256 and Gallie, et al., (1987) Gene 60:217-25), the CaMV 35S enhancer (see, e.g., Benfey, et al., (1990) EMBO J. 9:1685-96) and the enhancers of U.S. Pat. No. 7,803,992 may also be used, each of which is incorporated by reference. The above list of transcriptional enhancers is not meant to be limiting. Any appropriate transcriptional enhancer can be used in the embodiments.

The termination region may be native with the transcriptional initiation region, may be native with the operably linked DNA sequence of interest, may be native with the plant host, or may be derived from another source (i.e., foreign or heterologous to the promoter, the sequence of interest, the plant host or any combination thereof).

Convenient termination regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also, Guerineau, et al., (1991) Mol. Gen. Genet. 262:141-144; Proudfoot, (1991) Cell 64:671-674; Sanfacon, et al., (1991) Genes Dev. 5:141-149; Mogen, et al., (1990) Plant Cell 2:1261-1272; Munroe, et al., (1990) Gene 91:151-158; Ballas, et al., (1989) Nucleic Acids Res. 17:7891-7903 and Joshi, et al., (1987) Nucleic Acid Res. 15:9627-9639.

Where appropriate, a nucleic acid may be optimized for increased expression in the host organism. Thus, where the host organism is a plant, the synthetic nucleic acids can be synthesized using plant-preferred codons for improved expression. See, for example, Campbell and Gowri, (1990) Plant Physiol. 92:1-11 for a discussion of host-preferred codon usage. For example, although nucleic acid sequences of the embodiments may be expressed in both monocotyledonous and dicotyledonous plant species, sequences can be modified to account for the specific codon preferences and GC content preferences of monocotyledons or dicotyledons as these preferences have been shown to differ (Murray et al. (1989) Nucleic Acids Res. 17:477-498). Thus, the maize-preferred codon for a particular amino acid may be derived from known gene sequences from maize. Maize codon usage for 28 genes from maize plants is listed in Table 4 of Murray, et al., supra. Methods are available in the art for synthesizing plant-preferred genes. See, for example, U.S. Pat. Nos. 5,380,831, and 5,436,391 and Murray, et al., (1989) Nucleic Acids Res. 17:477-498, herein incorporated by reference.

Additional sequence modifications are known to enhance gene expression in a cellular host. These include elimination of sequences encoding spurious polyadenylation signals, exon-intron splice site signals, transposon-like repeats, and other well-characterized sequences that may be deleterious to gene expression. The GC content of the sequence may be adjusted to levels average for a given cellular host, as calculated by reference to known genes expressed in the host cell. The term “host cell” as used herein refers to a cell which contains a vector and supports the replication and/or expression of the expression vector is intended. Host cells may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian or mammalian cells or monocotyledonous or dicotyledonous plant cells. An example of a monocotyledonous host cell is a maize host cell. When possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.

The expression cassettes may additionally contain 5′ leader sequences. Such leader sequences can act to enhance translation. Translation leaders are known in the art and include: picornavirus leaders, for example, EMCV leader (Encephalomyocarditis 5′ noncoding region) (Elroy-Stein, et al., (1989) Proc. Natl. Acad. Sci. USA 86:6126-6130); potyvirus leaders, for example, TEV leader (Tobacco Etch Virus) (Gallie, et al., (1995) Gene 165(2):233-238), MDMV leader (Maize Dwarf Mosaic Virus), human immunoglobulin heavy-chain binding protein (BiP) (Macejak, et al., (1991) Nature 353:90-94); untranslated leader from the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4) (Jobling, et al., (1987) Nature 325:622-625); tobacco mosaic virus leader (TMV) (Gallie, et al., (1989) in Molecular Biology of RNA, ed. Cech (Liss, New York), pp. 237-256) and maize chlorotic mottle virus leader (MCMV) (Lommel, et al., (1991) Virology 81:382-385). See also, Della-Cioppa, et al., (1987) Plant Physiol. 84:965-968. Such constructs may also contain a “signal sequence” or “leader sequence” to facilitate co-translational or post-translational transport of the peptide to certain intracellular structures such as the chloroplast (or other plastid), endoplasmic reticulum, or Golgi apparatus.

By “signal sequence” is intended a sequence that is known or suspected to result in cotranslational or post-translational peptide transport across the cell membrane. In eukaryotes, this typically involves secretion into the Golgi apparatus, with some resulting glycosylation. Insecticidal toxins of bacteria are often synthesized as protoxins, which are protolytically activated in the gut of the target pest (Chang, (1987) Methods Enzymol. 153:507-516). In some embodiments, the signal sequence is located in the native sequence, or may be derived from a sequence of the embodiments. By “leader sequence” is intended any sequence that when translated, results in an amino acid sequence sufficient to trigger co-translational transport of the peptide chain to a subcellular organelle. Thus, this includes leader sequences targeting transport and/or glycosylation by passage into the endoplasmic reticulum, passage to vacuoles, plastids including chloroplasts, mitochondria and the like. Nuclear-encoded proteins targeted to the chloroplast thylakoid lumen compartment have a characteristic bipartite transit peptide, composed of a stromal targeting signal peptide and a lumen targeting signal peptide. The stromal targeting information is in the amino-proximal portion of the transit peptide. The lumen targeting signal peptide is in the carboxyl-proximal portion of the transit peptide, and contains all the information for targeting to the lumen. Recent research in proteomics of the higher plant chloroplast has achieved in the identification of numerous nuclear-encoded lumen proteins (Kieselbach et al. FEBS LETT 480:271-276, 2000; Peltier et al. Plant Cell 12:319-341, 2000; Bricker et al. Biochim. Biophys Acta 1503:350-356, 2001), the lumen targeting signal peptide of which can potentially be used in accordance with the present invention. About 80 proteins from Arabidopsis, as well as homologous proteins from spinach and garden pea, are reported by Kieselbach et al., Photosynthesis Research, 78:249-264, 2003. In particular, table 2 of this publication, which is incorporated into the description herewith by reference, discloses 85 proteins from the chloroplast lumen, identified by their accession number (see also US Patent Application Publication 2009/09044298). In addition, the recently published draft version of the rice genome (Goff et al, Science 296:92-100, 2002) is a suitable source for lumen targeting signal peptide which may be used in accordance with the present invention.

Suitable chloroplast transit peptides (CTP) are well known to one skilled in the art also include chimeric CTPs comprising but not limited to, an N-terminal domain, a central domain or a C-terminal domain from a CTP from Oryza sativa 1-deoxy-D xyulose-5-Phosphate Synthase, Oryza sativa-Superoxide dismutase, Oryza sativa-soluble starch synthase, Oryza sativa-NADP-dependent Malic acid enzyme, Oryza sativa-Phospho-2-dehydro-3-deoxyheptonate Aldolase 2, Oryza sativa-L-Ascorbate peroxidase 5, Oryza sativa-Phosphoglucan water dikinase, Zea Mays ssRUBISCO, Zea Mays-beta-glucosidase, Zea Mays-Malate dehydrogenase, Zea Mays Thioredoxin M-type US Patent Application Publication 2012/0304336).

The PHI-4 polypeptide gene to be targeted to the chloroplast may be optimized for expression in the chloroplast to account for differences in codon usage between the plant nucleus and this organelle. In this manner, the nucleic acids of interest may be synthesized using chloroplast-preferred codons. See, for example, U.S. Pat. No. 5,380,831, herein incorporated by reference.

In preparing the expression cassette, the various DNA fragments may be manipulated so as to provide for the DNA sequences in the proper orientation and, as appropriate, in the proper reading frame. Toward this end, adapters or linkers may be employed to join the DNA fragments or other manipulations may be involved to provide for convenient restriction sites, removal of superfluous DNA, removal of restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, restriction, annealing, resubstitutions, e.g., transitions and transversions, may be involved.

A number of promoters can be used in the practice of the embodiments. The promoters can be selected based on the desired outcome. The nucleic acids can be combined with constitutive, tissue-preferred, inducible, or other promoters for expression in the host organism. Suitable constitutive promoters for use in a plant host cell include, for example, the core promoter of the Rsyn7 promoter and other constitutive promoters disclosed in WO 1999/43838 and U.S. Pat. No. 6,072,050; the core CaMV 35S promoter (Odell, et al., (1985) Nature 313:810-812); rice actin (McElroy, et al., (1990) Plant Cell 2:163-171); ubiquitin (Christensen, et al., (1989) Plant Mol. Biol. 12:619-632 and Christensen, et al., (1992) Plant Mol. Biol. 18:675-689); pEMU (Last, et al., (1991) Theor. Appl. Genet. 81:581-588); MAS (Velten, et al., (1984) EMBO J. 3:2723-2730); ALS promoter (U.S. Pat. No. 5,659,026) and the like. Other constitutive promoters include, for example, those discussed in U.S. Pat. Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680; 5,268,463; 5,608,142 and 6,177,611.

Depending on the desired outcome, it may be beneficial to express the gene from an inducible promoter. Of particular interest for regulating the expression of the nucleotide sequences of the embodiments in plants are wound-inducible promoters. Such wound-inducible promoters, may respond to damage caused by insect feeding, and include potato proteinase inhibitor (pin II) gene (Ryan, (1990) Ann. Rev. Phytopath. 28:425-449; Duan, et al., (1996) Nature Biotechnology 14:494-498); wun1 and wun2, U.S. Pat. No. 5,428,148; win1 and win2 (Stanford, et al., (1989) Mol. Gen. Genet. 215:200-208); systemin (McGurl, et al., (1992) Science 225:1570-1573); WIP1 (Rohmeier, et al., (1993) Plant Mol. Biol. 22:783-792; Eckelkamp, et al., (1993) FEBS Letters 323:73-76); MPI gene (Corderok, et al., (1994) Plant J. 6(2):141-150) and the like, herein incorporated by reference.

Additionally, pathogen-inducible promoters may be employed in the methods and nucleotide constructs of the embodiments. Such pathogen-inducible promoters include those from pathogenesis-related proteins (PR proteins), which are induced following infection by a pathogen; e.g., PR proteins, SAR proteins, beta-1,3-glucanase, chitinase, etc. See, for example, Redolfi, et al., (1983) Neth. J. Plant Pathol. 89:245-254; Uknes, et al., (1992) Plant Cell 4:645-656 and Van Loon, (1985) Plant Mol. Virol. 4:111-116. See also, WO 1999/43819, herein incorporated by reference.

Of interest are promoters that are expressed locally at or near the site of pathogen infection. See, for example, Marineau, et al., (1987) Plant Mol. Biol. 9:335-342; Matton, et al., (1989) Molecular Plant-Microbe Interactions 2:325-331; Somsisch, et al., (1986) Proc. Natl. Acad. Sci. USA 83:2427-2430; Somsisch, et al., (1988) Mol. Gen. Genet. 2:93-98 and Yang, (1996) Proc. Natl. Acad. Sci. USA 93:14972-14977. See also, Chen, et al., (1996) Plant J. 10:955-966; Zhang, et al., (1994) Proc. Natl. Acad. Sci. USA 91:2507-2511; Warner, et al., (1993) Plant J. 3:191-201; Siebertz, et al., (1989) Plant Cell 1:961-968; U.S. Pat. No. 5,750,386 (nematode-inducible) and the references cited therein. Of particular interest is the inducible promoter for the maize PRms gene, whose expression is induced by the pathogen Fusarium moniliforme (see, for example, Cordero, et al., (1992) Physiol. Mol. Plant Path. 41:189-200).

Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator. Depending upon the objective, the promoter may be a chemical-inducible promoter, where application of the chemical induces gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression. Chemical-inducible promoters are known in the art and include, but are not limited to, the maize In2-2 promoter, which is activated by benzenesulfonamide herbicide safeners, the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides, and the tobacco PR-1a promoter, which is activated by salicylic acid. Other chemical-regulated promoters of interest include steroid-responsive promoters (see, for example, the glucocorticoid-inducible promoter in Schena, et al., (1991) Proc. Natl. Acad. Sci. USA 88:10421-10425 and McNellis, et al., (1998) Plant J. 14(2):247-257) and tetracycline-inducible and tetracycline-repressible promoters (see, for example, Gatz, et al., (1991) Mol. Gen. Genet. 227:229-237 and U.S. Pat. Nos. 5,814,618 and 5,789,156), herein incorporated by reference.

Tissue-preferred promoters can be utilized to target enhanced PHI-4 polypeptide expression within a particular plant tissue. Tissue-preferred promoters include those discussed in Yamamoto, et al., (1997) Plant J. 12(2)255-265; Kawamata, et al., (1997) Plant Cell Physiol. 38(7):792-803; Hansen, et al., (1997) Mol. Gen Genet. 254(3):337-343; Russell, et al., (1997) Transgenic Res. 6(2):157-168; Rinehart, et al., (1996) Plant Physiol. 112(3):1331-1341; Van Camp, et al., (1996) Plant Physiol. 112(2):525-535; Canevascini, et al., (1996) Plant Physiol. 112(2):513-524; Yamamoto, et al., (1994) Plant Cell Physiol. 35(5):773-778; Lam, (1994) Results Probl. Cell Differ. 20:181-196; Orozco, et al., (1993) Plant Mol Biol. 23(6):1129-1138; Matsuoka, et al., (1993) Proc Natl. Acad. Sci. USA 90(20):9586-9590 and Guevara-Garcia, et al., (1993) Plant J. 4(3):495-505. Such promoters can be modified, if necessary, for weak expression.

Leaf-preferred promoters are known in the art. See, for example, Yamamoto, et al., (1997) Plant J. 12(2):255-265; Kwon, et al., (1994) Plant Physiol. 105:357-67; Yamamoto, et al., (1994) Plant Cell Physiol. 35(5):773-778; Gotor, et al., (1993) Plant J. 3:509-18; Orozco, et al., (1993) Plant Mol. Biol. 23(6):1129-1138 and Matsuoka, et al., (1993) Proc. Natl. Acad. Sci. USA 90(20):9586-9590.

Root-preferred or root-specific promoters are known and can be selected from the many available from the literature or isolated de novo from various compatible species. See, for example, Hire, et al., (1992) Plant Mol. Biol. 20(2):207-218 (soybean root-specific glutamine synthetase gene); Keller and Baumgartner, (1991) Plant Cell 3(10):1051-1061 (root-specific control element in the GRP 1.8 gene of French bean); Sanger, et al., (1990) Plant Mol. Biol. 14(3):433-443 (root-specific promoter of the mannopine synthase (MAS) gene of Agrobacterium tumefaciens) and Miao, et al., (1991) Plant Cell 3(1):11-22 (full-length cDNA clone encoding cytosolic glutamine synthetase (GS), which is expressed in roots and root nodules of soybean). See also, Bogusz, et al., (1990) Plant Cell 2(7):633-641, where two root-specific promoters isolated from hemoglobin genes from the nitrogen-fixing nonlegume Parasponia andersonii and the related non-nitrogen-fixing nonlegume Trema tomentosa are described. The promoters of these genes were linked to a β-glucuronidase reporter gene and introduced into both the nonlegume Nicotiana tabacum and the legume Lotus corniculatus, and in both instances root-specific promoter activity was preserved. Leach and Aoyagi, (1991) describe their analysis of the promoters of the highly expressed roIC and roID root-inducing genes of Agrobacterium rhizogenes (see, Plant Science (Limerick) 79(1):69-76). They concluded that enhancer and tissue-preferred DNA determinants are dissociated in those promoters. Teeri, et al., (1989) used gene fusion to lacZ to show that the Agrobacterium T-DNA gene encoding octopine synthase is especially active in the epidermis of the root tip and that the TR2′ gene is root specific in the intact plant and stimulated by wounding in leaf tissue, an especially desirable combination of characteristics for use with an insecticidal or larvicidal gene (see, EMBO J. 8(2):343-350). The TR1′ gene fused to nptII (neomycin phosphotransferase II) showed similar characteristics. Additional root-preferred promoters include the VfENOD-GRP3 gene promoter (Kuster, et al., (1995) Plant Mol. Biol. 29(4):759-772) and rolB promoter (Capana, et al., (1994) Plant Mol. Biol. 25(4):681-691. See also, U.S. Pat. Nos. 5,837,876; 5,750,386; 5,633,363; 5,459,252; 5,401,836; 5,110,732 and 5,023,179.

“Seed-preferred” promoters include both “seed-specific” promoters (those promoters active during seed development such as promoters of seed storage proteins) as well as “seed-germinating” promoters (those promoters active during seed germination). See, Thompson, et al., (1989) BioEssays 10:108, herein incorporated by reference. Such seed-preferred promoters include, but are not limited to, Cim1 (cytokinin-induced message); cZ19B1 (maize 19 kDa zein); and milps (myo-inositol-1-phosphate synthase) (see, U.S. Pat. No. 6,225,529, herein incorporated by reference). Gamma-zein and Glb-1 are endosperm-specific promoters. For dicots, seed-specific promoters include, but are not limited to, Kunitz trypsin inhibitor 3 (KTi3) (Jofuku, K. D. and Goldberg, R. B. Plant Cell 1:1079-1093, 1989), bean β-phaseolin, napin, β-conglycinin, glycinin 1, soybean lectin, cruciferin, and the like. For monocots, seed-specific promoters include, but are not limited to, maize 15 kDa zein, 22 kDa zein, 27 kDa zein, g-zein, waxy, shrunken 1, shrunken 2, globulin 1, etc. See also, WO 2000/12733, where seed-preferred promoters from end1 and end2 genes are disclosed; herein incorporated by reference. In dicots, seed specific promoters include but are not limited to seed coat promoter from Arabidopsis, pBAN; and the early seed promoters from Arabidopsis, p26, p63, and p63tr (U.S. Pat. Nos. 7,294,760 and 7,847,153). A promoter that has “preferred” expression in a particular tissue is expressed in that tissue to a greater degree than in at least one other plant tissue. Some tissue-preferred promoters show expression almost exclusively in the particular tissue.

Where low level expression is desired, weak promoters will be used. Generally, the term “weak promoter” as used herein refers to a promoter that drives expression of a coding sequence at a low level. By low level expression at levels of about 1/1000 transcripts to about 1/100,000 transcripts to about 1/500,000 transcripts is intended. Alternatively, it is recognized that the term “weak promoters” also encompasses promoters that drive expression in only a few cells and not in others to give a total low level of expression. Where a promoter drives expression at unacceptably high levels, portions of the promoter sequence can be deleted or modified to decrease expression levels.

Such weak constitutive promoters include, for example the core promoter of the Rsyn7 promoter (WO 1999/43838 and U.S. Pat. No. 6,072,050), the core 35S CaMV promoter, and the like. Other constitutive promoters include, for example, those disclosed in U.S. Pat. Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680; 5,268,463; 5,608,142 and 6,177,611, herein incorporated by reference.

The above list of promoters is not meant to be limiting. Any appropriate promoter can be used in the embodiments.

Generally, the expression cassette will comprise a selectable marker gene for the selection of transformed cells. Selectable marker genes are utilized for the selection of transformed cells or tissues. Marker genes include genes encoding antibiotic resistance, such as those encoding neomycin phosphotransferase II (NEO) and hygromycin phosphotransferase (HPT), as well as genes conferring resistance to herbicidal compounds, such as glufosinate ammonium, bromoxynil, imidazolinones and 2,4-dichlorophenoxyacetate (2,4-D). Additional examples of suitable selectable marker genes include, but are not limited to, genes encoding resistance to chloramphenicol (Herrera Estrella, et al., (1983) EMBO J. 2:987-992); methotrexate (Herrera Estrella, et al., (1983) Nature 303:209-213 and Meijer, et al., (1991) Plant Mol. Biol. 16:807-820); streptomycin (Jones, et al., (1987) Mol. Gen. Genet. 210:86-91); spectinomycin (Bretagne-Sagnard, et al., (1996) Transgenic Res. 5:131-137); bleomycin (Hille, et al., (1990) Plant Mol. Biol. 7:171-176); sulfonamide (Guerineau, et al., (1990) Plant Mol. Biol. 15:127-136); bromoxynil (Stalker, et al., (1988) Science 242:419-423); glyphosate (Shaw, et al., (1986) Science 233:478-481 and U.S. patent application Ser. Nos. 10/004,357 and 10/427,692); phosphinothricin (DeBlock, et al., (1987) EMBO J. 6:2513-2518). See generally, Yarranton, (1992) Curr. Opin. Biotech. 3:506-511; Christopherson, et al., (1992) Proc. Natl. Acad. Sci. USA 89:6314-6318; Yao, et al., (1992) Cell 71:63-72; Reznikoff, (1992) Mol. Microbiol. 6:2419-2422; Barkley, et al., (1980) in The Operon, pp. 177-220; Hu, et al., (1987) Cell 48:555-566; Brown, et al., (1987) Cell 49:603-612; Figge, et al., (1988) Cell 52:713-722; Deuschle, et al., (1989) Proc. Natl. Acad. Sci. USA 86:5400-5404; Fuerst, et al., (1989) Proc. Natl. Acad. Sci. USA 86:2549-2553; Deuschle, et al., (1990) Science 248:480-483; Gossen, (1993) Ph.D. Thesis, University of Heidelberg; Reines, et al., (1993) Proc. Natl. Acad. Sci. USA 90:1917-1921; Labow, et al., (1990) Mol. Cell. Biol. 10:3343-3356; Zambretti, et al., (1992) Proc. Natl. Acad. Sci. USA 89:3952-3956; Baim, et al., (1991) Proc. Natl. Acad. Sci. USA 88:5072-5076; Wyborski, et al., (1991) Nucleic Acids Res. 19:4647-4653; Hillenand-Wissman, (1989) Topics Mol. Struc. Biol. 10:143-162; Degenkolb, et al., (1991) Antimicrob. Agents Chemother. 35:1591-1595; Kleinschnidt, et al., (1988) Biochemistry 27:1094-1104; Bonin, (1993) Ph.D. Thesis, University of Heidelberg; Gossen, et al., (1992) Proc. Natl. Acad. Sci. USA 89:5547-5551; Oliva, et al., (1992) Antimicrob. Agents Chemother. 36:913-919; Hlavka, et al., (1985) Handbook of Experimental Pharmacology, Vol. 78 (Springer-Verlag, Berlin) and Gill, et al., (1988) Nature 334:721-724. Such disclosures are herein incorporated by reference.

The above list of selectable marker genes is not meant to be limiting. Any selectable marker gene can be used in the embodiments.

Plant Transformation

The methods of the embodiments involve introducing a polypeptide or polynucleotide into a plant. “Introducing” is intended to mean presenting to the plant the polynucleotide or polypeptide in such a manner that the sequence gains access to the interior of a cell of the plant. The methods of the embodiments do not depend on a particular method for introducing a polynucleotide or polypeptide into a plant, only that the polynucleotide or polypeptides gains access to the interior of at least one cell of the plant. Methods for introducing polynucleotide or polypeptides into plants are known in the art including, but not limited to, stable transformation methods, transient transformation methods and virus-mediated methods.

“Stable transformation” is intended to mean that the nucleotide construct introduced into a plant integrates into the genome of the plant and is capable of being inherited by the progeny thereof. “Transient transformation” is intended to mean that a polynucleotide is introduced into the plant and does not integrate into the genome of the plant or a polypeptide is introduced into a plant. By “plant” is intended whole plants, plant organs (e.g., leaves, stems, roots, etc.), seeds, plant cells, propagules, embryos and progeny of the same. Plant cells can be differentiated or undifferentiated (e.g. callus, suspension culture cells, protoplasts, leaf cells, root cells, phloem cells, and pollen).

Transformation protocols as well as protocols for introducing nucleotide sequences into plants may vary depending on the type of plant or plant cell, i.e., monocot or dicot, targeted for transformation. Suitable methods of introducing nucleotide sequences into plant cells and subsequent insertion into the plant genome include microinjection (Crossway, et al., (1986) Biotechniques 4:320-334), electroporation (Riggs, et al., (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606), Agrobacterium-mediated transformation (U.S. Pat. Nos. 5,563,055 and 5,981,840), direct gene transfer (Paszkowski, et al., (1984) EMBO J. 3:2717-2722) and ballistic particle acceleration (see, for example, U.S. Pat. Nos. 4,945,050; 5,879,918; 5,886,244 and 5,932,782; Tomes, et al., (1995) in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, ed. Gamborg and Phillips, (Springer-Verlag, Berlin) and McCabe, et al., (1988) Biotechnology 6:923-926) and Lecl transformation (WO 2000/28058). For potato transformation see, Tu, et al., (1998) Plant Molecular Biology 37:829-838 and Chong, et al., (2000) Transgenic Research 9:71-78. Additional transformation procedures can be found in Weissinger, et al., (1988) Ann. Rev. Genet. 22:421-477; Sanford, et al., (1987) Particulate Science and Technology 5:27-37 (onion); Christou, et al., (1988) Plant Physiol. 87:671-674 (soybean); McCabe, et al., (1988) Bio/Technology 6:923-926 (soybean); Finer and McMullen, (1991) In Vitro Cell Dev. Biol. 27P:175-182 (soybean); Singh, et al., (1998) Theor. Appl. Genet. 96:319-324 (soybean); Datta, et al., (1990) Biotechnology 8:736-740 (rice); Klein, et al., (1988) Proc. Natl. Acad. Sci. USA 85:4305-4309 (maize); Klein, et al., (1988) Biotechnology 6:559-563 (maize); U.S. Pat. Nos. 5,240,855; 5,322,783 and 5,324,646; Klein, et al., (1988) Plant Physiol. 91:440-444 (maize); Fromm, et al., (1990) Biotechnology 8:833-839 (maize); Hooykaas-Van Slogteren, et al., (1984) Nature (London) 311:763-764; U.S. Pat. No. 5,736,369 (cereals); Bytebier, et al., (1987) Proc. Natl. Acad. Sci. USA 84:5345-5349 (Liliaceae); De Wet, et al., (1985) in The Experimental Manipulation of Ovule Tissues, ed. Chapman, et al., (Longman, N.Y.), pp. 197-209 (pollen); Kaeppler, et al., (1990) Plant Cell Reports 9:415-418 and Kaeppler, et al., (1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated transformation); D'Halluin, et al., (1992) Plant Cell 4:1495-1505 (electroporation); Li, et al., (1993) Plant Cell Reports 12:250-255 and Christou and Ford, (1995) Annals of Botany 75:407-413 (rice); Osjoda, et al., (1996) Nature Biotechnology 14:745-750 (maize via Agrobacterium tumefaciens); all of which are herein incorporated by reference.

In specific embodiments, the sequences of the embodiments can be provided to a plant using a variety of transient transformation methods. Such transient transformation methods include, but are not limited to, the introduction of the PHI-4 polypeptide or variants and fragments thereof directly into the plant or the introduction of the PHI-4 polypeptide transcript into the plant. Such methods include, for example, microinjection or particle bombardment. See, for example, Crossway, et al., (1986) Mol Gen. Genet. 202:179-185; Nomura, et al., (1986) Plant Sci. 44:53-58; Hepler, et al., (1994) Proc. Natl. Acad. Sci. 91:2176-2180 and Hush, et al., (1994) The Journal of Cell Science 107:775-784, all of which are herein incorporated by reference. Alternatively, the PHI-4 polypeptide polynucleotide can be transiently transformed into the plant using techniques known in the art. Such techniques include viral vector system and the precipitation of the polynucleotide in a manner that precludes subsequent release of the DNA. Thus, transcription from the particle-bound DNA can occur, but the frequency with which it is released to become integrated into the genome is greatly reduced. Such methods include the use of particles coated with polyethylimine (PEI; Sigma #P3143).

Methods are known in the art for the targeted insertion of a polynucleotide at a specific location in the plant genome. In one embodiment, the insertion of the polynucleotide at a desired genomic location is achieved using a site-specific recombination system. See, for example, WO 1999/25821, WO 1999/25854, WO 1999/25840, WO 1999/25855 and WO 1999/25853, all of which are herein incorporated by reference. Briefly, the polynucleotide of the embodiments can be contained in transfer cassette flanked by two non-identical recombination sites. The transfer cassette is introduced into a plant have stably incorporated into its genome a target site which is flanked by two non-identical recombination sites that correspond to the sites of the transfer cassette. An appropriate recombinase is provided and the transfer cassette is integrated at the target site. The polynucleotide of interest is thereby integrated at a specific chromosomal position in the plant genome.

Plant transformation vectors may be comprised of one or more DNA vectors needed for achieving plant transformation. For example, it is a common practice in the art to utilize plant transformation vectors that are comprised of more than one contiguous DNA segment. These vectors are often referred to in the art as “binary vectors”. Binary vectors as well as vectors with helper plasmids are most often used for Agrobacterium-mediated transformation, where the size and complexity of DNA segments needed to achieve efficient transformation is quite large, and it is advantageous to separate functions onto separate DNA molecules. Binary vectors typically contain a plasmid vector that contains the cis-acting sequences required for T-DNA transfer (such as left border and right border), a selectable marker that is engineered to be capable of expression in a plant cell, and a “gene of interest” (a gene engineered to be capable of expression in a plant cell for which generation of transgenic plants is desired). Also present on this plasmid vector are sequences required for bacterial replication. The cis-acting sequences are arranged in a fashion to allow efficient transfer into plant cells and expression therein. For example, the selectable marker gene and the pesticidal gene are located between the left and right borders. Often a second plasmid vector contains the trans-acting factors that mediate T-DNA transfer from Agrobacterium to plant cells. This plasmid often contains the virulence functions (Vir genes) that allow infection of plant cells by Agrobacterium, and transfer of DNA by cleavage at border sequences and vir-mediated DNA transfer, as is understood in the art (Hellens and Mullineaux, (2000) Trends in Plant Science 5:446-451). Several types of Agrobacterium strains (e.g. LBA4404, GV3101, EHA101, EHA105, etc.) can be used for plant transformation. The second plasmid vector is not necessary for transforming the plants by other methods such as microprojection, microinjection, electroporation, polyethylene glycol, etc.

In general, plant transformation methods involve transferring heterologous DNA into target plant cells (e.g., immature or mature embryos, suspension cultures, undifferentiated callus, protoplasts, etc.), followed by applying a maximum threshold level of appropriate selection (depending on the selectable marker gene) to recover the transformed plant cells from a group of untransformed cell mass. Following integration of heterologous foreign DNA into plant cells, one then applies a maximum threshold level of appropriate selection in the medium to kill the untransformed cells and separate and proliferate the putatively transformed cells that survive from this selection treatment by transferring regularly to a fresh medium. By continuous passage and challenge with appropriate selection, one identifies and proliferates the cells that are transformed with the plasmid vector. Molecular and biochemical methods can then be used to confirm the presence of the integrated heterologous gene of interest into the genome of the transgenic plant.

Explants are typically transferred to a fresh supply of the same medium and cultured routinely. Subsequently, the transformed cells are differentiated into shoots after placing on regeneration medium supplemented with a maximum threshold level of selecting agent. The shoots are then transferred to a selective rooting medium for recovering rooted shoot or plantlet. The transgenic plantlet then grows into a mature plant and produces fertile seeds (e.g., Hiei, et al., (1994) The Plant Journal 6:271-282; Ishida, et al., (1996) Nature Biotechnology 14:745-750). Explants are typically transferred to a fresh supply of the same medium and cultured routinely. A general description of the techniques and methods for generating transgenic plants are found in Ayres and Park, (1994) Critical Reviews in Plant Science 13:219-239 and Bommineni and Jauhar, (1997) Maydica 42:107-120. Since the transformed material contains many cells; both transformed and non-transformed cells are present in any piece of subjected target callus or tissue or group of cells. The ability to kill non-transformed cells and allow transformed cells to proliferate results in transformed plant cultures. Often, the ability to remove non-transformed cells is a limitation to rapid recovery of transformed plant cells and successful generation of transgenic plants.

The cells that have been transformed may be grown into plants in accordance with conventional ways. See, for example, McCormick, et al., (1986) Plant Cell Reports 5:81-84. These plants may then be grown, and either pollinated with the same transformed strain or different strains and the resulting hybrid having constitutive or inducible expression of the desired phenotypic characteristic identified. Two or more generations may be grown to ensure that expression of the desired phenotypic characteristic is stably maintained and inherited and then seeds harvested to ensure that expression of the desired phenotypic characteristic has been achieved.

The nucleotide sequences of the embodiments may be provided to the plant by contacting the plant with a virus or viral nucleic acids. Generally, such methods involve incorporating the nucleotide construct of interest within a viral DNA or RNA molecule. It is recognized that the recombinant proteins of the embodiments may be initially synthesized as part of a viral polyprotein, which later may be processed by proteolysis in vivo or in vitro to produce the desired PHI-4 polypeptide. It is also recognized that such a viral polyprotein, comprising at least a portion of the amino acid sequence of a PHI-4 polypeptide of the embodiments, may have the desired pesticidal activity. Such viral polyproteins and the nucleotide sequences that encode for them are encompassed by the embodiments. Methods for providing plants with nucleotide constructs and producing the encoded proteins in the plants, which involve viral DNA or RNA molecules are known in the art. See, for example, U.S. Pat. Nos. 5,889,191; 5,889,190; 5,866,785; 5,589,367 and 5,316,931, herein incorporated by reference.

Methods for transformation of chloroplasts are known in the art. See, for example, Svab, et al., (1990) Proc. Natl. Acad. Sci. USA 87:8526-8530; Svab and Maliga, (1993) Proc. Natl. Acad. Sci. USA 90:913-917; Svab and Maliga, (1993) EMBO J. 12:601-606. The method relies on particle gun delivery of DNA containing a selectable marker and targeting of the DNA to the plastid genome through homologous recombination. Additionally, plastid transformation can be accomplished by transactivation of a silent plastid-borne transgene by tissue-preferred expression of a nuclear-encoded and plastid-directed RNA polymerase. Such a system has been reported in McBride, et al., (1994) Proc. Natl. Acad. Sci. USA 91:7301-7305.

The embodiments further relate to plant-propagating material of a transformed plant of the embodiments including, but not limited to, seeds, tubers, corms, bulbs, leaves, and cuttings of roots and shoots.

The embodiments may be used for transformation of any plant species, including, but not limited to, monocots and dicots. Examples of plants of interest include, but are not limited to, corn (Zea mays), Brassica sp. (e.g., B. napus, B. rapa, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum), proso millet (Panicum miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea batatus), cassava (Manihot esculenta), coffee (Coffea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Persea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, barley, vegetables, ornamentals and conifers.

Vegetables include tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber (C. sativus), cantaloupe (C. cantalupensis), and musk melon (C. melo). Ornamentals include azalea (Rhododendron spp.), hydrangea (Macrophylla hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), poinsettia (Euphorbia pulcherrima), and chrysanthemum. Conifers that may be employed in practicing the embodiments include, for example, pines such as loblolly pine (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta) and Monterey pine (Pinus radiata); Douglas-fir (Pseudotsuga menziesii); Western hemlock (Tsuga canadensis); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true firs such as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as Western red cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis nootkatensis). Plants of the embodiments include crop plants (for example, corn, alfalfa, sunflower, Brassica, soybean, cotton, safflower, peanut, sorghum, wheat, millet, tobacco, etc.), such as corn and soybean plants.

Turf grasses include, but are not limited to: annual bluegrass (Poa annua); annual ryegrass (Lolium multiflorum); Canada bluegrass (Poa compressa); Chewing's fescue (Festuca rubra); colonial bentgrass (Agrostis tenuis); creeping bentgrass (Agrostis palustris); crested wheatgrass (Agropyron desertorum); fairway wheatgrass (Agropyron cristatum); hard fescue (Festuca longifolia); Kentucky bluegrass (Poa pratensis); orchardgrass (Dactylis glomerata); perennial ryegrass (Lolium perenne); red fescue (Festuca rubra); redtop (Agrostis alba); rough bluegrass (Poa trivialis); sheep fescue (Festuca ovina); smooth bromegrass (Bromus inermis); tall fescue (Festuca arundinacea); timothy (Phleum pratense); velvet bentgrass (Agrostis canina); weeping alkaligrass (Puccinellia distans); western wheatgrass (Agropyron smithii); Bermuda grass (Cynodon spp.); St. Augustine grass (Stenotaphrum secundatum); zoysia grass (Zoysia spp.); Bahia grass (Paspalum notatum); carpet grass (Axonopus affinis); centipede grass (Eremochloa ophiuroides); kikuyu grass (Pennisetum clandesinum); seashore paspalum (Paspalum vaginatum); blue gramma (Bouteloua gracilis); buffalo grass (Buchloe dactyloids); sideoats gramma (Bouteloua curtipendula).

Plants of interest include grain plants that provide seeds of interest, oil-seed plants, and leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, rice, sorghum, rye, millet, etc. Oil-seed plants include cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, flax, castor, olive etc. Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, etc.

Evaluation of Plant Transformation

Following introduction of heterologous foreign DNA into plant cells, the transformation or integration of heterologous gene in the plant genome is confirmed by various methods such as analysis of nucleic acids, proteins and metabolites associated with the integrated gene.

PCR analysis is a rapid method to screen transformed cells, tissue or shoots for the presence of incorporated gene at the earlier stage before transplanting into the soil (Sambrook and Russell, (2001) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.). PCR is carried out using oligonucleotide primers specific to the gene of interest or Agrobacterium vector background, etc.

Plant transformation may be confirmed by Southern blot analysis of genomic DNA (Sambrook and Russell, (2001) supra). In general, total DNA is extracted from the transformant, digested with appropriate restriction enzymes, fractionated in an agarose gel and transferred to a nitrocellulose or nylon membrane. The membrane or “blot” is then probed with, for example, radiolabeled 32P target DNA fragment to confirm the integration of introduced gene into the plant genome according to standard techniques (Sambrook and Russell, (2001) supra).

In Northern blot analysis, RNA is isolated from specific tissues of transformant, fractionated in a formaldehyde agarose gel, and blotted onto a nylon filter according to standard procedures that are routinely used in the art (Sambrook and Russell, (2001) supra). Expression of RNA encoded by the pesticidal gene is then tested by hybridizing the filter to a radioactive probe derived from a pesticidal gene, by methods known in the art (Sambrook and Russell, (2001) supra).

Western blot, biochemical assays and the like may be carried out on the transgenic plants to confirm the presence of protein encoded by the pesticidal gene by standard procedures (Sambrook and Russell, 2001, supra) using antibodies that bind to one or more epitopes present on the PHI-4 polypeptide.

Stacking of Traits in Transgenic Plant

Transgenic plants may comprise a stack of one or more insecticidal polynucleotides disclosed herein with one or more additional polynucleotides resulting in the production or suppression of multiple polypeptide sequences. Transgenic plants comprising stacks of polynucleotide sequences can be obtained by either or both of traditional breeding methods or through genetic engineering methods. These methods include, but are not limited to, breeding individual lines each comprising a polynucleotide of interest, transforming a transgenic plant comprising a gene disclosed herein with a subsequent gene, and co-transformation of genes into a single plant cell. As used herein, the term “stacked” includes having the multiple traits present in the same plant (i.e., both traits are incorporated into the nuclear genome, one trait is incorporated into the nuclear genome and one trait is incorporated into the genome of a plastid, or both traits are incorporated into the genome of a plastid). In one non-limiting example, “stacked traits” comprise a molecular stack where the sequences are physically adjacent to each other. A trait, as used herein, refers to the phenotype derived from a particular sequence or groups of sequences. Co-transformation of genes can be carried out using single transformation vectors comprising multiple genes or genes carried separately on multiple vectors. If the sequences are stacked by genetically transforming the plants, the polynucleotide sequences of interest can be combined at any time and in any order. The traits can be introduced simultaneously in a co-transformation protocol with the polynucleotides of interest provided by any combination of transformation cassettes. For example, if two sequences will be introduced, the two sequences can be contained in separate transformation cassettes (trans) or contained on the same transformation cassette (cis). Expression of the sequences can be driven by the same promoter or by different promoters. In certain cases, it may be desirable to introduce a transformation cassette that will suppress the expression of the polynucleotide of interest. This may be combined with any combination of other suppression cassettes or overexpression cassettes to generate the desired combination of traits in the plant. It is further recognized that polynucleotide sequences can be stacked at a desired genomic location using a site-specific recombination system. See, for example, WO 1999/25821, WO 1999/25854, WO 1999/25840, WO 1999/25855 and WO 1999/25853, all of which are herein incorporated by reference.

In some embodiments the polynucleotides encoding the PHI-4 polypeptides disclosed herein, alone or stacked with one or more additional insect resistance traits can be stacked with one or more additional input traits (e.g., herbicide resistance, fungal resistance, virus resistance or stress tolerance, disease resistance, male sterility, stalk strength, and the like) or output traits (e.g., increased yield, modified starches, improved oil profile, balanced amino acids, high lysine or methionine, increased digestibility, improved fiber quality, drought resistance, and the like). Thus, the polynucleotide embodiments can be used to provide a complete agronomic package of improved crop quality with the ability to flexibly and cost effectively control any number of agronomic pests.

Transgenes useful for stacking include but are not limited to:

1. Transgenes that Confer Resistance to Insects or Disease and that Encode:

(A) Plant disease resistance genes. Plant defenses are often activated by specific interaction between the product of a disease resistance gene (R) in the plant and the product of a corresponding avirulence (Avr) gene in the pathogen. A plant variety can be transformed with cloned resistance gene to engineer plants that are resistant to specific pathogen strains. See, for example, Jones, et al., (1994) Science 266:789 (cloning of the tomato Cf-9 gene for resistance to Cladosporium fulvum); Martin, et al., (1993) Science 262:1432 (tomato Pto gene for resistance to Pseudomonas syringae pv. tomato encodes a protein kinase); Mindrinos, et al., (1994) Cell 78:1089 (Arabidopsis RSP2 gene for resistance to Pseudomonas syringae), McDowell and Woffenden, (2003) Trends Biotechnol. 21(4):178-83 and Toyoda, et al., (2002) Transgenic Res. 11(6):567-82. A plant resistant to a disease is one that is more resistant to a pathogen as compared to the wild type plant.

(B) Genes encoding a Bacillus thuringiensis protein, a derivative thereof or a synthetic polypeptide modeled thereon. See, for example, Geiser, et al., (1986) Gene 48:109, who disclose the cloning and nucleotide sequence of a Bt delta-endotoxin gene. Moreover, DNA molecules encoding delta-endotoxin genes can be purchased from American Type Culture Collection (Rockville, Md.), for example, under ATCC Accession Numbers 40098, 67136, 31995 and 31998. Other non-limiting examples of Bacillus thuringiensis transgenes being genetically engineered are given in the following patents and patent applications and hereby are incorporated by reference for this purpose: U.S. Pat. Nos. 5,188,960; 5,689,052; 5,880,275; 5,986,177; 6,023,013, 6,060,594, 6,063,597, 6,077,824, 6,620,988, 6,642,030, 6,713,259, 6,893,826, 7,105,332; 7,179,965, 7,208,474; 7,227,056, 7,288,643, 7,323,556, 7,329,736, 7,449,552, 7,468,278, 7,510,878, 7,521,235, 7,544,862, 7,605,304, 7,696,412, 7,629,504, 7,705,216, 7,772,465, 7,790,846, 7,858,849 and WO 1991/14778; WO 1999/31248; WO 2001/12731; WO 1999/24581 and WO 1997/40162.

(C) A polynucleotide encoding an insect-specific hormone or pheromone such as an ecdysteroid and juvenile hormone, a variant thereof, a mimetic based thereon or an antagonist or agonist thereof. See, for example, the disclosure by Hammock, et al., (1990) Nature 344:458, of baculovirus expression of cloned juvenile hormone esterase, an inactivator of juvenile hormone.

(D) A polynucleotide encoding an insect-specific peptide which, upon expression, disrupts the physiology of the affected pest. For example, see the disclosures of, Regan, (1994) J. Biol. Chem. 269:9 (expression cloning yields DNA coding for insect diuretic hormone receptor); Pratt, et al., (1989) Biochem. Biophys. Res. Comm. 163:1243 (an allostatin is identified in Diploptera puntata); Chattopadhyay, et al., (2004) Critical Reviews in Microbiology 30(1):33-54; Zjawiony, (2004) J Nat Prod 67(2):300-310; Carlini and Grossi-de-Sa, (2002) Toxicon 40(11):1515-1539; Ussuf, et al., (2001) Curr 80(7):847-853 and Vasconcelos and Oliveira, (2004) Toxicon 44(4):385-403. See also, U.S. Pat. No. 5,266,317 to Tomalski, et al., who disclose genes encoding insect-specific toxins.

(E) A polynucleotide encoding an enzyme responsible for a hyperaccumulation of a monoterpene, a sesquiterpene, a steroid, hydroxamic acid, a phenylpropanoid derivative or another non-protein molecule with insecticidal activity.

(F) A polynucleotide encoding an enzyme involved in the modification, including the post-translational modification, of a biologically active molecule; for example, a glycolytic enzyme, a proteolytic enzyme, a lipolytic enzyme, a nuclease, a cyclase, a transaminase, an esterase, a hydrolase, a phosphatase, a kinase, a phosphorylase, a polymerase, an elastase, a chitinase and a glucanase, whether natural or synthetic. See, PCT Application WO 1993/02197 in the name of Scott, et al., which discloses the nucleotide sequence of a callase gene. DNA molecules which contain chitinase-encoding sequences can be obtained, for example, from the ATCC under Accession Numbers 39637 and 67152. See also, Kramer, et al., (1993) Insect Biochem. Molec. Biol. 23:691, who teach the nucleotide sequence of a cDNA encoding tobacco hookworm chitinase and Kawalleck, et al., (1993) Plant Molec. Biol. 21:673, who provide the nucleotide sequence of the parsley ubi4-2 polyubiquitin gene, and U.S. Pat. Nos. 6,563,020; 7,145,060 and 7,087,810.

(G) A polynucleotide encoding a molecule that stimulates signal transduction. For example, see the disclosure by Botella, et al., (1994) Plant Molec. Biol. 24:757, of nucleotide sequences for mung bean calmodulin cDNA clones, and Griess, et al., (1994) Plant Physiol. 104:1467, who provide the nucleotide sequence of a maize calmodulin cDNA clone.

(H) A polynucleotide encoding a hydrophobic moment peptide. See, PCT Application WO 1995/16776 and U.S. Pat. No. 5,580,852 disclosure of peptide derivatives of Tachyplesin which inhibit fungal plant pathogens) and PCT Application WO 1995/18855 and U.S. Pat. No. 5,607,914 (teaches synthetic antimicrobial peptides that confer disease resistance).

(I) A polynucleotide encoding a membrane permease, a channel former or a channel blocker. For example, see the disclosure by Jaynes, et al., (1993) Plant Sci. 89:43, of heterologous expression of a cecropin-beta lytic peptide analog to render transgenic tobacco plants resistant to Pseudomonas solanacearum.

(J) A gene encoding a viral-invasive protein or a complex toxin derived therefrom. For example, the accumulation of viral coat proteins in transformed plant cells imparts resistance to viral infection and/or disease development effected by the virus from which the coat protein gene is derived, as well as by related viruses. See, Beachy, et al., (1990) Ann. Rev. Phytopathol. 28:451. Coat protein-mediated resistance has been conferred upon transformed plants against alfalfa mosaic virus, cucumber mosaic virus, tobacco streak virus, potato virus X, potato virus Y, tobacco etch virus, tobacco rattle virus and tobacco mosaic virus. Id.

(K) A gene encoding an insect-specific antibody or an immunotoxin derived therefrom. Thus, an antibody targeted to a critical metabolic function in the insect gut would inactivate an affected enzyme, killing the insect. Cf. Taylor, et al., Abstract #497, SEVENTH INT'L SYMPOSIUM ON MOLECULAR PLANT-MICROBE INTERACTIONS (Edinburgh, Scotland, 1994) (enzymatic inactivation in transgenic tobacco via production of single-chain antibody fragments).

(L) A gene encoding a virus-specific antibody. See, for example, Tavladoraki, et al., (1993) Nature 366:469, who show that transgenic plants expressing recombinant antibody genes are protected from virus attack.

(M) A polynucleotide encoding a developmental-arrestive protein produced in nature by a pathogen or a parasite. Thus, fungal endo alpha-1,4-D-polygalacturonases facilitate fungal colonization and plant nutrient release by solubilizing plant cell wall homo-alpha-1,4-D-galacturonase. See, Lamb, et al., (1992) Bio/Technology 10:1436. The cloning and characterization of a gene which encodes a bean endopolygalacturonase-inhibiting protein is described by Toubart, et al., (1992) Plant J. 2:367.

(N) A polynucleotide encoding a developmental-arrestive protein produced in nature by a plant. For example, Logemann, et al., (1992) Bio/Technology 10:305, have shown that transgenic plants expressing the barley ribosome-inactivating gene have an increased resistance to fungal disease.

(O) Genes involved in the Systemic Acquired Resistance (SAR) Response and/or the pathogenesis related genes. Briggs, (1995) Current Biology 5(2), Pieterse and Van Loon, (2004) Curr. Opin. Plant Bio. 7(4):456-64 and Somssich, (2003) Cell 113(7):815-6.

(P) Antifungal genes (Cornelissen and Melchers, (1993) Pl. Physiol. 101:709-712 and Parijs, et al., (1991) Planta 183:258-264 and Bushnell, et al., (1998) Can. J. of Plant Path. 20(2):137-149. Also see, U.S. patent application Ser. Nos. 09/950,933; 11/619,645; 11/657,710; 11/748,994; 11/774,121 and U.S. Pat. Nos. 6,891,085 and 7,306,946. LysM Receptor-like kinases for the perception of chitin fragments as a first step in plant defense response against fungal pathogens (US 2012/0110696).

(Q) Detoxification genes, such as for fumonisin, beauvericin, moniliformin and zearalenone and their structurally related derivatives. For example, see, U.S. Pat. Nos. 5,716,820; 5,792,931; 5,798,255; 5,846,812; 6,083,736; 6,538,177; 6,388,171 and 6,812,380.

(R) A polynucleotide encoding a Cystatin and cysteine proteinase inhibitors. See, U.S. Pat. No. 7,205,453.

(S) Defensin genes. See, WO 2003/000863 and U.S. Pat. Nos. 6,911,577; 6,855,865; 6,777,592 and 7,238,781.

(T) Genes conferring resistance to nematodes. See, e.g., PCT Application WO 1996/30517; PCT Application WO 1993/19181, WO 2003/033651 and Urwin, et al., (1998) Planta 204:472-479, Williamson, (1999) Curr Opin Plant Bio. 2(4):327-31; U.S. Pat. Nos. 6,284,948 and 7,301,069 and miR164 genes (WO 2012/058266).

(U) Genes that confer resistance to Phytophthora Root Rot, such as the Rps 1, Rps 1-a, Rps 1-b, Rps 1-c, Rps 1-d, Rps 1-e, Rps 1-k, Rps 2, Rps 3-a, Rps 3-b, Rps 3-c, Rps 4, Rps 5, Rps 6, Rps 7 and other Rps genes. See, for example, Shoemaker, et al., Phytophthora Root Rot Resistance Gene Mapping in Soybean, Plant Genome IV Conference, San Diego, Calif. (1995).

(V) Genes that confer resistance to Brown Stem Rot, such as described in U.S. Pat. No. 5,689,035 and incorporated by reference for this purpose.

(W) Genes that confer resistance to Colletotrichum, such as described in US Patent Application Publication US 2009/0035765 and incorporated by reference for this purpose. This includes the Rcg locus that may be utilized as a single locus conversion.

2. Transgenes that Confer Resistance to a Herbicide, for Example:

(A) A polynucleotide encoding resistance to a herbicide that inhibits the growing point or meristem, such as an imidazolinone or a sulfonylurea. Exemplary genes in this category code for mutant ALS and AHAS enzyme as described, for example, by Lee, et al., (1988) EMBO J. 7:1241 and Miki, et al., (1990) Theor. Appl. Genet. 80:449, respectively. See also, U.S. Pat. Nos. 5,605,011; 5,013,659; 5,141,870; 5,767,361; 5,731,180; 5,304,732; 4,761,373; 5,331,107; 5,928,937 and 5,378,824; U.S. Pat. application Ser. No. 11/683,737 and International Publication WO 1996/33270.

(B) A polynucleotide encoding a protein for resistance to Glyphosate (resistance imparted by mutant 5-enolpyruvl-3-phosphikimate synthase (EPSP) and aroA genes, respectively) and other phosphono compounds such as glufosinate (phosphinothricin acetyl transferase (PAT) and Streptomyces hygroscopicus phosphinothricin acetyl transferase (bar) genes), and pyridinoxy or phenoxy proprionic acids and cyclohexones (ACCase inhibitor-encoding genes). See, for example, U.S. Pat. No. 4,940,835 to Shah, et al., which discloses the nucleotide sequence of a form of EPSPS which can confer glyphosate resistance. U.S. Pat. No. 5,627,061 to Barry, et al., also describes genes encoding EPSPS enzymes. See also, U.S. Pat. Nos. 6,566,587; 6,338,961; 6,248,876 B1; 6,040,497; 5,804,425; 5,633,435; 5,145,783; 4,971,908; 5,312,910; 5,188,642; 5,094,945, 4,940,835; 5,866,775; 6,225,114 B1; 6,130,366; 5,310,667; 4,535,060; 4,769,061; 5,633,448; 5,510,471; Re. 36,449; RE 37,287 E and U.S. Pat. No. 5,491,288 and International Publications EP 1173580; WO 2001/66704; EP 1173581 and EP 1173582, which are incorporated herein by reference for this purpose. Glyphosate resistance is also imparted to plants that express a gene encoding a glyphosate oxido-reductase enzyme as described more fully in U.S. Pat. Nos. 5,776,760 and 5,463,175, which are incorporated herein by reference for this purpose. In addition glyphosate resistance can be imparted to plants by the over expression of genes encoding glyphosate N-acetyltransferase. See, for example, U.S. Pat. Nos. 7,462,481; 7,405,074 and US Patent Application Publication Number US 2008/0234130. A DNA molecule encoding a mutant aroA gene can be obtained under ATCC Accession Number 39256, and the nucleotide sequence of the mutant gene is disclosed in U.S. Pat. No. 4,769,061 to Comai. EP Application Number 0 333 033 to Kumada, et al., and U.S. Pat. No. 4,975,374 to Goodman, et al., disclose nucleotide sequences of glutamine synthetase genes which confer resistance to herbicides such as L-phosphinothricin. The nucleotide sequence of a phosphinothricin-acetyl-transferase gene is provided in EP Application Numbers 0 242 246 and 0 242 236 to Leemans, et al., De Greef, et al., (1989) Bio/Technology 7:61, describe the production of transgenic plants that express chimeric bar genes coding for phosphinothricin acetyl transferase activity. See also, U.S. Pat. Nos. 5,969,213; 5,489,520; 5,550,318; 5,874,265; 5,919,675; 5,561,236; 5,648,477; 5,646,024; 6,177,616 B1 and 5,879,903, which are incorporated herein by reference for this purpose. Exemplary genes conferring resistance to phenoxy proprionic acids and cyclohexones, such as sethoxydim and haloxyfop, are the Acc1-S1, Acc1-S2 and Acc1-S3 genes described by Marshall, et al., (1992) Theor. Appl. Genet. 83:435.

(C) A polynucleotide encoding a protein for resistance to herbicide that inhibits photosynthesis, such as a triazine (psbA and gs+genes) and a benzonitrile (nitrilase gene). Przibilla, et al., (1991) Plant Cell 3:169, describe the transformation of Chlamydomonas with plasmids encoding mutant psbA genes. Nucleotide sequences for nitrilase genes are disclosed in U.S. Pat. No. 4,810,648 to Stalker and DNA molecules containing these genes are available under ATCC Accession Numbers 53435, 67441 and 67442. Cloning and expression of DNA coding for a glutathione S-transferase is described by Hayes, et al., (1992) Biochem. J. 285:173.

(D) A polynucleotide encoding a protein for resistance to Acetohydroxy acid synthase, which has been found to make plants that express this enzyme resistant to multiple types of herbicides, has been introduced into a variety of plants (see, e.g., Hattori, et al., (1995) Mol Gen Genet. 246:419). Other genes that confer resistance to herbicides include: a gene encoding a chimeric protein of rat cytochrome P4507A1 and yeast NADPH-cytochrome P450 oxidoreductase (Shiota, et al., (1994) Plant Physiol 106:17), genes for glutathione reductase and superoxide dismutase (Aono, et al., (1995) Plant Cell Physiol 36:1687) and genes for various phosphotransferases (Datta, et al., (1992) Plant Mol Biol 20:619).

(E) A polynucleotide encoding resistance to a herbicide targeting Protoporphyrinogen oxidase (protox) which is necessary for the production of chlorophyll. The protox enzyme serves as the target for a variety of herbicidal compounds. These herbicides also inhibit growth of all the different species of plants present, causing their total destruction. The development of plants containing altered protox activity which are resistant to these herbicides are described in U.S. Pat. Nos. 6,288,306 B1; 6,282,837 B1 and 5,767,373 and International Publication WO 2001/12825.

(F) The aad-1 gene (originally from Sphingobium herbicidovorans) encodes the aryloxyalkanoate dioxygenase (AAD-1) protein. The trait confers tolerance to 2,4-dichlorophenoxyacetic acid and aryloxyphenoxypropionate (commonly referred to as “fop” herbicides such as quizalofop) herbicides. The aad-1 gene, itself, for herbicide tolerance in plants was first disclosed in WO 2005/107437 (see also, US 2009/0093366). The aad-12 gene, derived from Delftia acidovorans, which encodes the aryloxyalkanoate dioxygenase (AAD-12) protein that confers tolerance to 2,4-dichlorophenoxyacetic acid and pyridyloxyacetate herbicides by deactivating several herbicides with an aryloxyalkanoate moiety, including phenoxy auxin (e.g., 2,4-D, MCPA), as well as pyridyloxy auxins (e.g., fluroxypyr, triclopyr).

(G) A polynucleotide encoding a herbicide resistant dicamba monooxygenase disclosed in US Patent Application Publication 2003/0135879 for imparting dicamba tolerance;

(H) A polynucleotide molecule encoding bromoxynil nitrilase (Bxn) disclosed in U.S. Pat. No. 4,810,648 for imparting bromoxynil tolerance;

(I) A polynucleotide molecule encoding phytoene (crtl) described in Misawa, et al., (1993) Plant J. 4:833-840 and in Misawa, et al., (1994) Plant J. 6:481-489 for norflurazon tolerance.

3. Transgenes that Confer or Contribute to an Altered Grain Characteristic

Such as:

(A) Altered fatty acids, for example, by

(1) Down-regulation of stearoyl-ACP to increase stearic acid content of the plant. See, Knultzon, et al., (1992) Proc. Natl. Acad. Sci. USA 89:2624 and WO 1999/64579 (Genes to Alter Lipid Profiles in Corn).

(2) Elevating oleic acid via FAD-2 gene modification and/or decreasing linolenic acid via FAD-3 gene modification (see, U.S. Pat. Nos. 6,063,947; 6,323,392; 6,372,965 and WO 1993/11245).

(3) Altering conjugated linolenic or linoleic acid content, such as in WO 2001/12800.

(4) Altering LEC1, AGP, Dek1, Superal1, mi1 ps, various Ipa genes such as Ipa1, Ipa3, hpt or hggt. For example, see, WO 2002/42424, WO 1998/22604, WO 2003/011015, WO 2002/057439, WO 2003/011015, U.S. Pat. Nos. 6,423,886, 6,197,561, 6,825,397 and US Patent Application Publication Numbers US 2003/0079247, US 2003/0204870 and Rivera-Madrid, et al., (1995) Proc. Natl. Acad. Sci. 92:5620-5624.

(5) Genes encoding delta-8 desaturase for making long-chain polyunsaturated fatty acids (U.S. Pat. Nos. 8,058,571 and 8,338,152), delta-9 desaturase for lowering saturated fats (U.S. Pat. No. 8,063,269), Primula Δ6-desaturase for improving omega-3 fatty acid profiles.

(6) Isolated nucleic acids and proteins associated with lipid and sugar metabolism regulation, in particular, lipid metabolism protein (LMP) used in methods of producing transgenic plants and modulating levels of seed storage compounds including lipids, fatty acids, starches or seed storage proteins and use in methods of modulating the seed size, seed number, seed weights, root length and leaf size of plants (EP 2404499).

(7) Altering expression of a High-Level Expression of Sugar-Inducible 2 (HSI2) protein in the plant to increase or decrease expression of HSI2 in the plant. Increasing expression of HSI2 increases oil content while decreasing expression of HSI2 decreases abscisic acid sensitivity and/or increases drought resistance (US Patent Application Publication Number 2012/0066794).

(8) Expression of cytochrome b5 (Cb5) alone or with FAD2 to modulate oil content in plant seed, particular to increase the levels of omega-3 fatty acids and improve the ratio of omega-6 to omega-3 fatty acids (US Patent Application Publication Number 2011/0191904).

(9) Nucleic acid molecules encoding wrinkled1-like polypeptides for modulating sugar metabolism (U.S. Pat. No. 8,217,223).

(B) Altered phosphorus content, for example, by the

(1) Introduction of a phytase-encoding gene would enhance breakdown of phytate, adding more free phosphate to the transformed plant. For example, see, Van Hartingsveldt, et al., (1993) Gene 127:87, for a disclosure of the nucleotide sequence of an Aspergillus niger phytase gene.

(2) Modulating a gene that reduces phytate content. In maize, this, for example, could be accomplished, by cloning and then re-introducing DNA associated with one or more of the alleles, such as the LPA alleles, identified in maize mutants characterized by low levels of phytic acid, such as in WO 2005/113778 and/or by altering inositol kinase activity as in WO 2002/059324, US Patent Application Publication Number 2003/0009011, WO 2003/027243, US Patent Application Publication Number 2003/0079247, WO 1999/05298, U.S. Pat. Nos. 6,197,561, 6,291,224, 6,391,348, WO 2002/059324, US Patent Application Publication Number 2003/0079247, WO 1998/45448, WO 1999/55882, WO 2001/04147.

(C) Altered carbohydrates affected, for example, by altering a gene for an enzyme that affects the branching pattern of starch or, a gene altering thioredoxin such as NTR and/or TRX (see, U.S. Pat. No. 6,531,648. which is incorporated by reference for this purpose) and/or a gamma zein knock out or mutant such as cs27 or TUSC27 or en27 (see, U.S. Pat. No. 6,858,778 and US Patent Application Publication Number 2005/0160488, US Patent Application Publication Number 2005/0204418, which are incorporated by reference for this purpose). See, Shiroza, et al., (1988) J. Bacteriol. 170:810 (nucleotide sequence of Streptococcus mutant fructosyltransferase gene), Steinmetz, et al., (1985) Mol. Gen. Genet. 200:220 (nucleotide sequence of Bacillus subtilis levansucrase gene), Pen, et al., (1992) Bio/Technology 10:292 (production of transgenic plants that express Bacillus licheniformis alpha-amylase), Elliot, et al., (1993) Plant Molec. Biol. 21:515 (nucleotide sequences of tomato invertase genes), Søgaard, et al., (1993) J. Biol. Chem. 268:22480 (site-directed mutagenesis of barley alpha-amylase gene) and Fisher, et al., (1993) Plant Physiol. 102:1045 (maize endosperm starch branching enzyme II), WO 1999/10498 (improved digestibility and/or starch extraction through modification of UDP-D-xylose 4-epimerase, Fragile 1 and 2, Ref1, HCHL, C4H), U.S. Pat. No. 6,232,529 (method of producing high oil seed by modification of starch levels (AGP)). The fatty acid modification genes mentioned herein may also be used to affect starch content and/or composition through the interrelationship of the starch and oil pathways.

(D) Altered antioxidant content or composition, such as alteration of tocopherol or tocotrienols. For example, see, U.S. Pat. No. 6,787,683, US Patent Application Publication Number 2004/0034886 and WO 2000/68393 involving the manipulation of antioxidant levels and WO 2003/082899 through alteration of a homogentisate geranyl geranyl transferase (hggt).

(E) Altered essential seed amino acids. For example, see, U.S. Pat. No. 6,127,600 (method of increasing accumulation of essential amino acids in seeds), U.S. Pat. No. 6,080,913 (binary methods of increasing accumulation of essential amino acids in seeds), U.S. Pat. No. 5,990,389 (high lysine), WO 1999/40209 (alteration of amino acid compositions in seeds), WO 1999/29882 (methods for altering amino acid content of proteins), U.S. Pat. No. 5,850,016 (alteration of amino acid compositions in seeds), WO 1998/20133 (proteins with enhanced levels of essential amino acids), U.S. Pat. No. 5,885,802 (high methionine), U.S. Pat. No. 5,885,801 (high threonine), U.S. Pat. No. 6,664,445 (plant amino acid biosynthetic enzymes), U.S. Pat. No. 6,459,019 (increased lysine and threonine), U.S. Pat. No. 6,441,274 (plant tryptophan synthase beta subunit), U.S. Pat. No. 6,346,403 (methionine metabolic enzymes), U.S. Pat. No. 5,939,599 (high sulfur), U.S. Pat. No. 5,912,414 (increased methionine), WO 1998/56935 (plant amino acid biosynthetic enzymes), WO 1998/45458 (engineered seed protein having higher percentage of essential amino acids), WO 1998/42831 (increased lysine), U.S. Pat. No. 5,633,436 (increasing sulfur amino acid content), U.S. Pat. No. 5,559,223 (synthetic storage proteins with defined structure containing programmable levels of essential amino acids for improvement of the nutritional value of plants), WO 1996/01905 (increased threonine), WO 1995/15392 (increased lysine), US Patent Application Publication Number 2003/0163838, US Patent Application Publication Number 2003/0150014, US Patent Application Publication Number 2004/0068767, U.S. Pat. No. 6,803,498, WO 2001/79516.

4. Genes that Control Male-Sterility:

There are several methods of conferring genetic male sterility available, such as multiple mutant genes at separate locations within the genome that confer male sterility, as disclosed in U.S. Pat. Nos. 4,654,465 and 4,727,219 to Brar, et al., and chromosomal translocations as described by Patterson in U.S. Pat. Nos. 3,861,709 and 3,710,511. In addition to these methods, Albertsen, et al., U.S. Pat. No. 5,432,068, describe a system of nuclear male sterility which includes: identifying a gene which is critical to male fertility; silencing this native gene which is critical to male fertility; removing the native promoter from the essential male fertility gene and replacing it with an inducible promoter; inserting this genetically engineered gene back into the plant; and thus creating a plant that is male sterile because the inducible promoter is not “on” resulting in the male fertility gene not being transcribed. Fertility is restored by inducing or turning “on”, the promoter, which in turn allows the gene that confers male fertility to be transcribed.

(A) Introduction of a deacetylase gene under the control of a tapetum-specific promoter and with the application of the chemical N-Ac-PPT (WO 2001/29237).

(B) Introduction of various stamen-specific promoters (WO 1992/13956, WO 1992/13957).

(C) Introduction of the barnase and the barstar gene (Paul, et al., (1992) Plant Mol. Biol. 19:611-622).

For additional examples of nuclear male and female sterility systems and genes, see also, U.S. Pat. Nos. 5,859,341; 6,297,426; 5,478,369; 5,824,524; 5,850,014 and 6,265,640, all of which are hereby incorporated by reference.

5. Genes that Create a Site for Site Specific DNA Integration.

This includes the introduction of FRT sites that may be used in the FLP/FRT system and/or Lox sites that may be used in the Cre/Loxp system. For example, see, Lyznik, et al., (2003) Plant Cell Rep 21:925-932 and WO 1999/25821, which are hereby incorporated by reference. Other systems that may be used include the Gin recombinase of phage Mu (Maeser, et al., (1991) Vicki Chandler, The Maize Handbook ch. 118 (Springer-Verlag 1994), the Pin recombinase of E. coli (Enomoto, et al., 1983) and the R/RS system of the pSRi plasmid (Araki, et al., 1992).

6. Genes that Affect Abiotic Stress Resistance

Including but not limited to flowering, ear and seed development, enhancement of nitrogen utilization efficiency, altered nitrogen responsiveness, drought resistance or tolerance, cold resistance or tolerance and salt resistance or tolerance and increased yield under stress.

(A) For example, see: WO 2000/73475 where water use efficiency is altered through alteration of malate; U.S. Pat. Nos. 5,892,009, 5,965,705, 5,929,305, 5,891,859, 6,417,428, 6,664,446, 6,706,866, 6,717,034, 6,801,104, WO 2000/060089, WO 2001/026459, WO 2001/035725, WO 2001/034726, WO 2001/035727, WO 2001/036444, WO 2001/036597, WO 2001/036598, WO 2002/015675, WO 2002/017430, WO 2002/077185, WO 2002/079403, WO 2003/013227, WO 2003/013228, WO 2003/014327, WO 2004/031349, WO 2004/076638, WO 199809521.

(B) WO 199938977 describing genes, including CBF genes and transcription factors effective in mitigating the negative effects of freezing, high salinity and drought on plants, as well as conferring other positive effects on plant phenotype.

(C) US Patent Application Publication Number 2004/0148654 and WO 2001/36596 where abscisic acid is altered in plants resulting in improved plant phenotype such as increased yield and/or increased tolerance to abiotic stress.

(D) WO 2000/006341, WO 2004/090143, U.S. Pat. Nos. 7,531,723 and 6,992,237 where cytokinin expression is modified resulting in plants with increased stress tolerance, such as drought tolerance, and/or increased yield. Also see, WO 2002/02776, WO 2003/052063, JP 2002/281975, U.S. Pat. No. 6,084,153, WO 2001/64898, U.S. Pat. Nos. 6,177,275 and 6,107,547 (enhancement of nitrogen utilization and altered nitrogen responsiveness).

(E) For ethylene alteration, see, US Patent Application Publication Number 2004/0128719, US Patent Application Publication Number 2003/0166197 and WO 2000/32761.

(F) For plant transcription factors or transcriptional regulators of abiotic stress, see, e.g., US Patent Application Publication Number 2004/0098764 or US Patent Application Publication Number 2004/0078852.

(G) Genes that increase expression of vacuolar pyrophosphatase such as AVP1 (U.S. Pat. No. 8,058,515) for increased yield; nucleic acid encoding a HSFA4 or a HSFA5 (Heat Shock Factor of the class A4 or A5) polypeptides, an oligopeptide transporter protein (OPT4-like) polypeptide; a plastochron2-like (PLA2-like) polypeptide or a Wuschel related homeobox 1-like (WOX1-like) polypeptide (U.S. Patent Application Publication Number US 2011/0283420).

(H) Down regulation of polynucleotides encoding poly (ADP-ribose) polymerase (PARP) proteins to modulate programmed cell death (U.S. Pat. No. 8,058,510) for increased vigor.

(I) Polynucleotide encoding DTP21 polypeptides for conferring drought resistance (US Patent Application Publication Number US 2011/0277181).

(J) Nucleotide sequences encoding ACC Synthase 3 (ACS3) proteins for modulating development, modulating response to stress, and modulating stress tolerance (US Patent Application Publication Number US 2010/0287669).

(K) Polynucleotides that encode proteins that confer a drought tolerance phenotype (DTP) for conferring drought resistance (WO 2012/058528).

(L) Tocopherol cyclase (TC) genes for conferring drought and salt tolerance (US Patent Application Publication Number 2012/0272352).

(M) CAAX amino terminal family proteins for stress tolerance (U.S. Pat. No. 8,338,661).

(N) Mutations in the SAL1 encoding gene have increased stress tolerance, including increased drought resistant (US Patent Application Publication Number 2010/0257633).

(O) Expression of a nucleic acid sequence encoding a polypeptide selected from the group consisting of: GRF polypeptide, RAA1-like polypeptide, SYR polypeptide, ARKL polypeptide, and YTP polypeptide increasing yield-related traits (US Patent Application Publication Number 2011/0061133).

(P) Modulating expression in a plant of a nucleic acid encoding a Class III Trehalose Phosphate Phosphatase (TPP) polypeptide for enhancing yield-related traits in plants, particularly increasing seed yield (US Patent Application Publication Number 2010/0024067).

Other genes and transcription factors that affect plant growth and agronomic traits such as yield, flowering, plant growth and/or plant structure, can be introduced or introgressed into plants, see e.g., WO 1997/49811 (LHY), WO 1998/56918 (ESD4), WO 1997/10339 and U.S. Pat. No. 6,573,430 (TFL), U.S. Pat. No. 6,713,663 (FT), WO 1996/14414 (CON), WO 1996/38560, WO 2001/21822 (VRN1), WO 2000/44918 (VRN2), WO 1999/49064 (GI), WO 2000/46358 (FR1), WO 1997/29123, U.S. Pat. Nos. 6,794,560, 6,307,126 (GAI), WO 1999/09174 (D8 and Rht) and WO 2004/076638 and WO 2004/031349 (transcription factors).

7. Genes that Confer Increased Yield

(A) A transgenic crop plant transformed by a 1-AminoCyclopropane-1-Carboxylate Deaminase-like Polypeptide (ACCDP) coding nucleic acid, wherein expression of the nucleic acid sequence in the crop plant results in the plant's increased root growth, and/or increased yield, and/or increased tolerance to environmental stress as compared to a wild type variety of the plant (U.S. Pat. No. 8,097,769).

(B) Over-expression of maize zinc finger protein gene (Zm-ZFP1) using a seed preferred promoter has been shown to enhance plant growth, increase kernel number and total kernel weight per plant (US Patent Application Publication Number 2012/0079623).

(C) Constitutive over-expression of maize lateral organ boundaries (LOB) domain protein (Zm-LOBDP1) has been shown to increase kernel number and total kernel weight per plant (US Patent Application Publication Number 2012/0079622).

(D) Enhancing yield-related traits in plants by modulating expression in a plant of a nucleic acid encoding a VIM1 (Variant in Methylation 1)-like polypeptide or a VTC2-like (GDP-L-galactose phosphorylase) polypeptide or a DUF1685 polypeptide or an ARF6-like (Auxin Responsive Factor) polypeptide (WO 2012/038893).

(E) Modulating expression in a plant of a nucleic acid encoding a Ste20-like polypeptide or a homologue thereof gives plants having increased yield relative to control plants (EP 2431472).

(F) Genes encoding nucleoside diphosphatase kinase (NDK) polypeptides and homologs thereof for modifying the plant's root architecture (US Patent Application Publication Number 2009/0064373).

8. Genes that Confer Plant Digestibility

(A) Altering the level of xylan present in the cell wall of a plant by modulating expression of xylan synthase (U.S. Pat. No. 8,173,866).

In some embodiment the stacked trait may be a trait or event that has received regulatory approval including but not limited to the events in Table 2A-1F.

TABLE 2A Triticum aestivum Wheat Event Company Description AP205CL BASF Inc. Selection for a mutagenized version of the enzyme acetohydroxyacid synthase (AHAS), also known as acetolactate synthase (ALS) or acetolactate pyruvate-lyase. AP602CL BASF Inc. Selection for a mutagenized version of the enzyme acetohydroxyacid synthase (AHAS), also known as acetolactate synthase (ALS) or acetolactate pyruvate-lyase. BW255-2, BASF Inc. Selection for a mutagenized version of the BW238-3 enzyme acetohydroxyacid synthase (AHAS), also known as acetolactate synthase (ALS) or acetolactate pyruvate- lyase. BW7 BASF Inc. Tolerance to imidazolinone herbicides induced by chemical mutagenesis of the acetohydroxyacid synthase (AHAS) gene using sodium azide. MON71800 Monsanto Glyphosate tolerant wheat variety Company produced by inserting a modified 5-enolpyruvylshikimate- 3-phosphate synthase (EPSPS) encoding gene from the soil bacterium Agrobacterium tumefaciens, strain CP4. SWP965001 Cyanamid Selection for a mutagenized version of the Crop enzyme acetohydroxyacid synthase Protection (AHAS), also known as acetolactate synthase (ALS) or acetolactate pyruvate-lyase. Teal 11A BASF Inc. Selection for a mutagenized version of the enzyme acetohydroxyacid synthase (AHAS), also known as acetolactate synthase (ALS) or acetolactate pyruvate-lyase. A2704-12, Bayer Glufosinate ammonium herbicide tolerant A2704-21, CropScience soybean produced by inserting a modified A5547-35 (Aventis phosphinothricin acetyltransferase (PAT) CropScience encoding gene from the soil bacterium (AgrEvo)) Streptomyces viridochromogenes.

TABLE 2B Glycine max L. Soybean Event Compay Description A5547-127 Bayer Glufosinate ammonium herbicide tolerant CropScience soybean produced by inserting a modified (Aventis phosphinothricin acetyltransferase (PAT) CropScience encoding gene from the soil bacterium (AgrEvo)) Streptomyces viridochromogenes. BPS- BASF Inc. The introduced csr1-2 gene from CV127-9 Arabidopsis thaliana encodes an acetohydroxyacid synthase protein that confers tolerance to imidazolinone herbicides due to a point mutation that results in a single amino acid substitution in which the serine residue at position 653 is replaced by asparagine (S653N). DP-305423 Pioneer High oleic acid soybean produced by Hi-Bred inserting additional copies of a portion of International the omega-6 desaturase encoding gene, Inc. gm-fad2-1 resulting in silencing of the endogenous omega-6 desaturase gene (FAD2-1). DP356043 Pioneer Soybean event with two herbicide tolerance Hi-Bred genes: glyphosate N-acetlytransferase, which International detoxifies glyphosate, and a modified Inc. acetolactate synthase (ALS) gene which is tolerant to ALS-inhibiting herbicides. G94-1, DuPont High oleic acid soybean produced by G94-19, Canada inserting a second copy of the fatty acid G168 Agricultural desaturase (GmFad2-1) encoding gene Products from soybean, which resulted in “silencing” of the endogenous host gene. GTS 40-3-2 Monsanto Glyphosate tolerant soybean variety Company produced by inserting a modified 5-enolpyruvylshikimate-3- phosphate synthase (EPSPS) encoding gene from the soil bacterium Agrobacterium tumefaciens. GU262 Bayer Glufosinate ammonium herbicide tolerant CropScience soybean produced by inserting a modified (Aventis phosphinothricin acetyltransferase (PAT) CropScience encoding gene from the soil bacterium (AgrEvo)) Streptomyces viridochromogenes. MON87701 Monsanto Resistance to lepidopteran pests of soybean Company including velvetbean caterpillar (Anticarsia gemmatalis) and soybean looper (Pseudoplusia includens). MON87701 x Monsanto Glyphosate herbicide tolerance through MON89788 Company expression of the EPSPS encoding gene from A. tumefaciens strain CP4, and resistance to lepidopteran pests of soybean including velvetbean caterpillar (Anticarsia gemmatalis) and soybean looper (Pseudoplusia includens) via expression of the Cry1Ac encoding gene from B. thuringiensis. MON89788 Monsanto Glyphosate-tolerant soybean produced by Company inserting a modified 5-enolpyruvylshikimate- 3-phosphate synthase (EPSPS) encoding aroA (epsps) gene from Agrobacterium tumefaciens CP4. OT96-15 Agriculture & Low linolenic acid soybean produced Agri-Food through traditional cross-breeding to Canada incorporate the novel trait from a naturally occurring fan1 gene mutant that was selected for low linolenic acid. W62, W98 Bayer Glufosinate ammonium herbicide tolerant CropScience soybean produced by inserting a modified (Aventis phosphinothricin acetyltransferase (PAT) CropScience encoding gene from the soil bacterium (AgrEvo)) Streptomyces hygroscopicus.

TABLE 2C Helianthus annuus Sunflower Event Company Description X81359 BASF Inc. Tolerance to imidazolinone herbicides by selection of a naturally occurring mutant.

TABLE 2D Medicago sativa Alfalfa Event Company Description J101, J163 Monsanto Glyphosate herbicide tolerant alfalfa (lucerne) Company produced by inserting a gene encoding the and Forage enzyme 5-enolypyruvylshikimate-3-phosphate Genetics synthase (EPSPS) from the CP4 strain of International Agrobacterium tumefaciens.

TABLE 2E Oryza sativa Rice Event Company Description CL121, BASF Inc. Tolerance to the imidazolinone herbicide, CL141, imazethapyr, induced by chemical CFX51 mutagenesis of the acetolactate synthase (ALS) enzyme using ethyl methanesulfonate (EMS). IMINTA-1, BASF Inc. Tolerance to imidazolinone herbicides IMINTA-4 induced by chemical mutagenesis of the acetolactate synthase (ALS) enzyme using sodium azide. LLRICE06, Aventis Glufosinate ammonium herbicide tolerant LLRICE62 CropScience rice produced by inserting a modified phosphinothricin acetyltransferase (PAT) encoding gene from the soil bacterium Streptomyces hygroscopicus). LLRICE601 Bayer Glufosinate ammonium herbicide tolerant CropScience rice produced by inserting a modified (Aventis phosphinothricin acetyltransferase (PAT) CropScience encoding gene from the soil bacterium (AgrEvo)) Streptomyces hygroscopicus). PWC16 BASF Inc. Tolerance to the imidazolinone herbicide, imazethapyr, induced by chemical mutagenesis of the acetolactate synthase (ALS) enzyme using ethyl methanesulfonate (EMS).

TABLE 2F Zea mays L. Maize Event Company Description 176 Syngenta Insect-resistant maize produced by inserting the Seeds, cry1Ab gene from Bacillus thuringiensis subsp. Inc. kurstaki. The genetic modification affords resistance to attack by the European corn borer (ECB). 3751IR Pioneer Selection of somaclonal variants by culture of Hi-Bred embryos on imidazolinone containing media. Inter- national Inc. 676, 678, Pioneer Male-sterile and glufosinate ammonium 680 Hi-Bred herbicide tolerant maize produced by inserting Inter- genes encoding DNA adenine methylase and national phosphinothricin acetyltransferase (PAT) Inc. from Escherichia coli and Streptomyces viridochromogenes, respectively. B16 Dekalb Glufosinate ammonium herbicide tolerant (DLL25) Genetics maize produced by inserting the gene Cor- encoding phosphinothricin acetyltransferase poration (PAT) from Streptomyces hygroscopicus. BT11 Syngenta Insect-resistant and herbicide tolerant maize (X4334CBR, Seeds, produced by inserting the cry1Ab gene from X4734CBR) Inc. Bacillus thuringiensis subsp. kurstaki, and the phosphinothricin N-acetyltransferase (PAT) encoding gene from S. viridochromogenes. BT11 x Syngenta Stacked insect resistant and herbicide tolerant GA21 Seeds, maize produced by conventional cross breeding Inc. of parental lines BT11 (OECD unique identifier: SYN-BTO11-1) and GA21 (OECD unique identifier: MON-OOO21-9). BT11 x Syngenta Stacked insect resistant and herbicide tolerant MIR162 Seeds, maize produced by conventional cross breeding Inc. of parental lines BT11 (OECD unique identifier: SYN-BTO11-1) and MIR162 (OECD unique identifier: SYN-IR162-4). Resistance to the European Corn Borer and tolerance to the herbicide glufosinate ammonium (Liberty) is derived from BT11, which contains the cry1Ab gene from Bacillus thuringiensis subsp. kurstaki, and the phosphinothricin N- acetyltransferase (PAT) encoding gene from S. viridochromogenes. Resistance to other lepidopteran pests, including H. zea, S. frugiperda, A. ipsilon, and S. albicosta, is derived from MIR162, which contains the vip3Aa gene from Bacillus thuringiensis strain AB88. BT11 x Syngenta Bacillus thuringiensis Cry1Ab delta-endotoxin MIR162 x Seeds, protein and the genetic material necessary for its MIR604 Inc. production (via elements of vector pZO1502) in Event Bt11 corn (OECD Unique Identifier: SYN-BTO11-1) x Bacillus thuringiensis Vip3Aa20 insecticidal protein and the genetic material necessary for its production (via elements of vector pNOV1300) in Event MIR162 maize (OECD Unique Identifier: SYN-IR162-4) x modified Cry3A protein and the genetic material necessary for its production (via elements of vector pZM26) in Event MIR604 corn (OECD Unique Identifier: SYN-IR6O4-5). BT11 x Syngenta Resistance to coleopteran pests, particularly MIR162 x Seeds, corn rootworm pests (Diabrotica spp.) and MIR604 x Inc. several lepidopteran pests of corn, including GA21 European corn borer (ECB, Ostrinia nubilalis), corn earworm (CEW, Helicoverpa zea), fall army worm (FAW, Spodoptera frugiperda), and black cutworm (BCW, Agrotis ipsilon); tolerance to glyphosate and glufosinate- ammonium containing herbicides. BT11 x Syngenta Stacked insect resistant and herbicide tolerant MIR604 Seeds, maize produced by conventional cross breeding Inc. of parental lines BT11 (OECD unique identifier: SYN-BTO11-1) and MIR604 (OECD unique identifier: SYN-IR605-5). Resistance to the European Corn Borer and tolerance to the herbicide glufosinate ammonium (Liberty) is derived from BT11, which contains the cry1Ab gene from Bacillus thuringiensis subsp. kurstaki, and the phosphinothricin N-acetyltransferase (PAT) encoding gene from S. viridochromogenes. Corn rootworm-resistance is derived from MIR604 which contains the mcry3A gene from Bacillus thuringiensis. BT11 x Syngenta Stacked insect resistant and herbicide tolerant MIR604 x Seeds, maize produced by conventional cross GA21 Inc. breeding of parental lines BT11 (OECD unique identifier: SYN-BTO11-1), MIR604 (OECD unique identifier: SYN-IR605-5) and GA21 (OECD unique identifier: MON-OOO21-9). Resistance to the European Corn Borer and tolerance to the herbicide glufosinate ammonium (Liberty) is derived from BT11, which contains the cry1Ab gene from Bacillus thuringiensis subsp. kurstaki, and the phosphinothricin N- acetyltransferase (PAT) encoding gene from S. viridochromogenes. Corn rootworm-resistance is derived from MIR604 which contains the mcry3A gene from Bacillus thuringiensis. Tolerance to glyphosate herbicide is derived from GA21 which contains a modified EPSPS gene from maize. CBH-351 Aventis Insect-resistant and glufosinate ammonium Crop- herbicide tolerant maize developed by inserting Science genes encoding Cry9C protein from Bacillus thuringiensis subsp tolworthi and phosphinothricin acetyltransferase (PAT) from Streptomyces hygroscopicus. DAS- DOW Lepidopteran insect resistant and glufosinate 06275-8 Agro- ammonium herbicide-tolerant maize variety Sciences produced by inserting the cry1F gene from LLC Bacillus thuringiensis var aizawai and the phosphinothricin acetyltransferase (PAT) from Streptomyces hygroscopicus. DAS- DOW Corn rootworm-resistant maize produced by 59122-7 Agro- inserting the cry34Ab1 and cry35Ab1 genes Sciences from Bacillus thuringiensis strain P5149B1. LLC and The PAT encoding gene from Streptomyces Pioneer viridochromogenes was introduced as a Hi-Bred selectable marker. Inter- national Inc. DAS- DOW Stacked insect resistant and herbicide tolerant 59122-7 x Agro- maize produced by conventional cross breeding NK603 Sciences of parental lines DAS-59122-7 (OECD unique LLC and identifier: DAS-59122-7) with NK603 (OECD Pioneer unique identifier: MON-OO6O3-6). Corn Hi-Bred rootworm-resistance is derived from DAS- Inter- 59122-7 which contains the cry34Ab1 and national cry35Ab1 genes from Bacillus thuringiensis Inc. strain PS149B1. Tolerance to glyphosate herbicide is derived from NK603. DAS- DOW Stacked insect resistant and herbicide tolerant 59122-7 x Agro- maize produced by conventional cross breeding TC1507 x Sciences of parental lines DAS-59122-7 (OECD unique NK603 LLC and identifier: DAS-59122-7) and TC1507 (OECD Pioneer unique identifier: DAS-O15O7-1) with NK603 Hi-Bred (OECD unique identifier: MON-OO6O3-6). Inter- Corn rootworm-resistance is derived from national DAS-59122-7 which contains the cry34Ab1 Inc. and cry35Ab1 genes from Bacillus thuringiensis strain PS149B1. Lepidopteran resistance and tolerance to glufosinate ammonium herbicide is derived from TC1507. Tolerance to glyphosate herbicide is derived from NK603. DBT418 Dekalb Insect-resistant and glufosinate ammonium Genetics herbicide tolerant maize developed by inserting Cor- genes encoding Cry1AC protein from poration Bacillus thuringiensis subsp kurstaki and phosphinothricin acetyltransferase (PAT) from Streptomyces hygroscopicus DK404SR BASF Somaclonal variants with a modified Inc. acetyl-CoA-carboxylase (ACCase) were selected by culture of embryos on sethoxydim enriched medium. Event Syngenta Maize line expressing a heat stable alpha- 3272 Seeds, amylase gene amy797E for use in the dry-grind Inc. ethanol process. The phosphomannose isomerase gene from E. coli was used as a selectable marker. Event Pioneer Maize event expressing tolerance to glyphosate 98140 Hi-Bred herbicide, via expression of a modified Inter- bacterial glyphosate N-acetlytransferase, and national ALS-inhibiting herbicides, vial expression of Inc. a modified form of the maize acetolactate synthase enzyme. EXP1910IT Syngenta Tolerance to the imidazolinone herbicide, Seeds, imazethapyr, induced by chemical mutagenesis Inc. of the acetolactate synthase (ALS) enzyme (formerly using ethyl methanesulfonate (EMS). Zeneca Seeds) GA21 Syngenta Introduction, by particle bombardment, of a Seeds, modified 5-enolpyruvyl shikimate-3-phosphate Inc. synthase (EPSPS), an enzyme involved in (formerly the shikimate biochemical pathway for the Zeneca production of the aromatic amino acids. Seeds) GA21 x Monsanto Stacked insect resistant and herbicide tolerant MON810 Company corn hybrid derived from conventional cross- breeding of the parental lines GA21 (OECD identifier: MONOOO21-9) and MON810 (OECD identifier: MON-OO81O-6). IT Pioneer Tolerance to the imidazolinone herbicide, Hi-Bred imazethapyr, was obtained by in vitro Inter- selection of somaclonal variants. national Inc. LY038 Monsanto Altered amino acid composition, specifically Company elevated levels of lysine, through the introduction of the cordapA gene, derived from Corynebacterium glutamicum, encoding the enzyme dihydrodipicolinate synthase (cDHDPS). MIR162 Syngenta Insect-resistant maize event expressing a Seeds, Vip3A protein from Bacillus thuringiensis Inc. and the Escherichia coli PMI selectable marker MIR604 Syngenta Corn rootworm resistant maize produced Seeds, Inc. by transformation with a modified cry3A gene. The phosphomannose isomerase gene from E. coli was used as a selectable marker. MIR604 x Syngenta Stacked insect resistant and herbicide tolerant GA21 Seeds, maize produced by conventional cross Inc. breeding of parental lines MIR604 (OECD unique identifier: SYN-IR6O5-5) and GA21 (OECD unique identifier: MON-OOO21-9). Corn rootworm-resistance is derived from MIR604 which contains the mcry3A gene from Bacillus thuringiensis. Tolerance to glyphosate herbicide is derived from GA21. MON80100 Monsanto Insect-resistant maize produced by inserting Company the cry1Ab gene from Bacillus thuringiensis subsp. kurstaki. The genetic modification affords resistance to attack by the European corn borer (ECB). MPN802 Monsanto Insect-resistant and glyphosate herbicide Company tolerant maize produced by inserting the genes encoding the Cry1Ab protein from Bacillus thuringiensis and the 5- enolpyruvylshikimate-3-phosphate synthase (EPSPS) from A. tumefaciens strain CP4. MON809 Pioneer Resistance to European corn borer (Ostrinia Hi- nubilalis) by introduction of a synthetic Bred cry1Ab gene. Glyphosate resistance via Inter- introduction of the bacterial version of a national plant enzyme, 5-enolpyruvyl Inc. shikimate-3-phosphate synthase (EPSPS). MON810 Monsanto Insect-resistant maize produced by inserting Company a truncated form of the cry1Ab gene from Bacillus thuringiensis subsp. kurstaki HD-1. The genetic modification affords resistance to attack by the European corn borer (ECB). MON810 x Monsanto Stacked insect resistant and enhanced lysine LY038 Company content maize derived from conventional cross-breeding of the parental lines MON810 (OECD identifier: MON-OO81O-6) and LY038 (OECD identifier: REN-OOO38-3). MON810 x Monsanto Stacked insect resistant and glyphosate MON88017 Company tolerant maize derived from conventional cross-breeding of the parental lines MON810 (OECD identifier: MON-OO81O-6) and MON88017 (OECD identifier: MON-88O17-3). European corn borer (ECB) resistance is derived from a truncated form of the cry1Ab gene from Bacillus thuringiensis subsp. kurstaki HD-1 present in MON810. Corn rootworm resistance is derived from the cry3Bb1 gene from Bacillus thuringiensis subspecies kumamotoensis strain EG4691 present in MON88017. Glyphosate tolerance is derived from a 5-enolpyruvylshikimate-3- phosphate synthase (EPSPS) encoding gene from Agrobacterium tumefaciens strain CP4 present in MON88017. MON832 Monsanto Introduction, by particle bombardment, of Company glyphosate oxidase (GOX) and a modified 5-enolpyruvyl shikimate-3-phosphate synthase (EPSPS), an enzyme involved in the shikimate biochemical pathway for the production of the aromatic amino acids. MON863 Monsanto Corn root worm resistant maize produced by Company inserting the cry3Bb1 gene from Bacillus thuringiensis subsp. kumamotoensis. MON863 x Monsanto Stacked insect resistant corn hybrid derived MON810 Company from conventional cross-breeding of the parental lines MON863 (OECD identifier: MON-OO863-5) and MON810 (OECD identifier: MON-OO81O-6) MON863 x Monsanto Stacked insect resistant and herbicide MON810 x Company tolerant corn hybrid derived from conventional NK603 cross-breeding of the stacked hybrid MON- OO863-5 x MON-OO81O-6 and NK603 (OECD identifier: MON-OO6O3-6). MON863 x Monsanto Stacked insect resistant and herbicide tolerant NK603 Company corn hybrid derived from conventional cross- breeding of the parental lines MON863 (OECD identifier:MON-OO863-5) and NK603 (OECD identifier: MON-OO6O3-6). MON87460 Monsanto MON 87460 was developed to provide Company reduced yield loss under water-limited conditions compared to conventional maize. Efficacy in MON 87460 is derived by expression of the inserted Bacillus subtilis cold shock protein B (CspB). MON88017 Monsanto Corn rootworm-resistant maize produced by Company inserting the cry3Bb1 gene from Bacillus thuringiensis subspecies kumamotoensis strain EG4691. Glyphosate tolerance derived by inserting a 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) encoding gene from Agrobacterium tumefaciens strain CP4. MON89034 Monsanto Maize event expressing two different Company insecticidal proteins from Bacillus thuringiensis providing resistance to number of lepidopteran pests. MON89034 x Monsanto Stacked insect resistant and glyphosate tolerant MON88017 Company maize derived from conventional cross-breeding of the parental lines MON89034 (OECD identifier: MON-89O34-3) and MON88017 (OECD identifier: MON-88O17-3). Resistance to Lepidopteran insects is derived from two cry genes present in MON89043. Corn rootworm resistance is derived from a single cry genes and glyphosate tolerance is derived from the 5- enolpyruvylshikimate-3-phosphate synthase (EPSPS) encoding gene from Agrobacterium tumefaciens present in MON88017. M0N89034 x Monsanto Stacked insect resistant and herbicide tolerant NK603 Company maize produced by conventional cross breeding of parental lines MON89034 (OECD identifier: MON-89O34-3) with NK603 (OECD unique identifier: MON-OO603-6). Resistance to Lepidopteran insects is derived from two cry genes present in MON89043. Tolerance to glyphosate herbicide is derived from NK603. MON89034 x Monsanto Stacked insect resistant and herbicide tolerant TC1507 x Company maize produced by conventional cross breeding MON88017 x and of parental lines: MON89034, TC1507, DAS- Mycogen MON88017, and DAS-59122. Resistance to the 59122-7 Seeds above-ground and below-ground insect pests c/o and tolerance to glyphosate and glufosinate- Dow ammonium containing herbicides. Agro- Sciences LLC MS3 Bayer Male sterility caused by expression of the Crop- barnase ribonuclease gene from Bacillus Science amyloliquefaciens; PPT resistance was via (Aventis PPT-acetyltransferase (PAT). Crop- Science (AgrEvo)) MS6 Bayer Male sterility caused by expression of the Crop- barnase ribonuclease gene from Bacillus Science amyloliquefaciens; PPT resistance was via (Aventis PPT-acetyltransferase (PAT). Crop Science (AgrEvo)) NK603 Monsanto Introduction, by particle bombardment, of a Company modified 5-enolpyruvyl shikimate-3-phosphate synthase (EPSPS), an enzyme involved in the shikimate biochemical pathway for the production of the aromatic amino acids. NK603 x Monsanto Stacked insect resistant and herbicide tolerant MON810 Company corn hybrid derived from conventional cross- breeding of the parental lines NK603 (OECD identifier: MON-OO6O3-6) and MON810 (OECD identifier: MON-OO81O-6). NK603 x Monsanto Stacked glufosinate ammonium and T25 Company glyphosate herbicide tolerant maize hybrid derived from conventional cross-breeding of the parental lines NK603 (OECD identifier: MON-OO6O3-6) and T25 (OECD identifier: ACS-ZM003-2). T14, T25 Bayer Glufosinate herbicide tolerant maize Crop- produced by inserting the phosphinothricin Science N-acetyltransferase (Aventis (PAT) encoding gene from the aerobic Crop- actinomycete Streptomyces Science viridochromogenes. (AgrEvo)) T25 x Bayer Stacked insect resistant and herbicide tolerant MON810 Crop- corn hybrid derived from conventional cross- Science breeding of the parental lines T25 (OECD (Aventis identifier: ACS-ZMOO3-2) and MON810 Crop- (OECD identifier: MONOO81O-6). Science (AgrEvo)) TC1507 Mycogen Insect-resistant and glufosinate ammonium (c/o Dow herbicide tolerant maize produced by inserting Agro- the cry1F gene from Bacillus thuringiensis var. Sciences); aizawai and the phosphinothricin N- Pioneer acetyltransferase encoding gene from (c/o Streptomyces viridochromogenes. DuPont) TC1507 x DOW Stacked insect resistant and herbicide tolerant DAS- Agro- maize produced by conventional cross 59122-7 Sciences breeding of parental lines TC1507 (OECD LLC and unique identifier: DAS-O15O7-1) with Pioneer DAS-59122-7 (OECD unique identifier: Hi-Bred DAS-59122-7). Resistance to lepidopteran Inter- insects is derived from TC1507 due national the presence of the cry1F gene from Bacillus Inc. thuringiensis var. aizawai. Corn rootworm- resistance is derived from DAS-59122-7 which contains the cry34Ab1 and cry35Ab1 genes from Bacillus thuringiensis strain PS149B1. Tolerance to glufosinate ammonium herbicide is derived from TC1507 from the phosphinothricin N- acetyltransferase encoding gene from Streptomyces viridochromogenes. TC1507 x DOW Stacked insect resistant and herbicide tolerant NK603 Agro- corn hybrid derived from conventional cross- Sciences breeding of the parental lines 1507 (OECD LLC identifier: DAS-O15O7-1) and NK603 (OECD identifier: MON-OO6O3-6).

Other events with regulatory approval are well known to one skilled in the art and can be found at the Center for Environmental Risk Assessment (cera-gmc.org/?action=gm_crop_database, which can be accessed using the www prefix).

Gene Silencing

In some embodiments the stacked trait may be in the form of silencing of one or more polynucleotides of interest resulting in suppression of one or more target pest polypeptides. In some embodiments the silencing is achieved through the use of a suppression DNA construct.

In some embodiments one or more of the PHI-4 polypeptides or fragments or variants thereof may be stacked with one or more polynucleotides encoding one or more polypeptides having insecticidal activity or agronomic traits as set forth supra and optionally may further include one or more polynucleotides providing for gene silencing of one or more target polynucleotides as discussed infra.

“Suppression DNA construct” is a recombinant DNA construct which when transformed or stably integrated into the genome of the plant, results in “silencing” of a target gene in the plant. The target gene may be endogenous or transgenic to the plant. “Silencing,” as used herein with respect to the target gene, refers generally to the suppression of levels of mRNA or protein/enzyme expressed by the target gene, and/or the level of the enzyme activity or protein functionality. The term “suppression” includes lower, reduce, decline, decrease, inhibit, eliminate and prevent. “Silencing” or “gene silencing” does not specify mechanism and is inclusive, and not limited to, anti-sense, cosuppression, viral-suppression, hairpin suppression, stem-loop suppression, RNAi-based approaches and small RNA-based approaches.

A suppression DNA construct may comprise a region derived from a target gene of interest and may comprise all or part of the nucleic acid sequence of the sense strand (or antisense strand) of the target gene of interest. Depending upon the approach to be utilized, the region may be 100% identical or less than 100% identical (e.g., at least 50% or any integer between 51% and 100% identical) to all or part of the sense strand (or antisense strand) of the gene of interest.

Suppression DNA constructs are well-known in the art, are readily constructed once the target gene of interest is selected, and include, without limitation, cosuppression constructs, antisense constructs, viral-suppression constructs, hairpin suppression constructs, stem-loop suppression constructs, double-stranded RNA-producing constructs, and more generally, RNAi (RNA interference) constructs and small RNA constructs such as siRNA (short interfering RNA) constructs and miRNA (microRNA) constructs.

“Antisense inhibition” refers to the production of antisense RNA transcripts capable of suppressing the expression of the target protein.

“Antisense RNA” refers to an RNA transcript that is complementary to all or part of a target primary transcript or mRNA and that blocks the expression of a target isolated nucleic acid fragment (U.S. Pat. No. 5,107,065). The complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e., at the 5′ non-coding sequence, 3′ non-coding sequence, introns or the coding sequence.

“Cosuppression” refers to the production of sense RNA transcripts capable of suppressing the expression of the target protein. “Sense” RNA refers to RNA transcript that includes the mRNA and can be translated into protein within a cell or in vitro. Cosuppression constructs in plants have been previously designed by focusing on overexpression of a nucleic acid sequence having homology to a native mRNA, in the sense orientation, which results in the reduction of all RNA having homology to the overexpressed sequence (see, Vaucheret, et al. (1998) Plant J. 16:651-659 and Gura, (2000) Nature 404:804-808).

Another variation describes the use of plant viral sequences to direct the suppression of proximal mRNA encoding sequences (PCT Publication WO 1998/36083).

Recent work has described the use of “hairpin” structures that incorporate all, or part, of an mRNA encoding sequence in a complementary orientation that results in a potential “stem-loop” structure for the expressed RNA (PCT Publication Number WO 1999/53050). In this case the stem is formed by polynucleotides corresponding to the gene of interest inserted in either sense or anti-sense orientation with respect to the promoter and the loop is formed by some polynucleotides of the gene of interest, which do not have a complement in the construct. This increases the frequency of cosuppression or silencing in the recovered transgenic plants. For review of hairpin suppression see, Wesley, et al., (2003) Methods in Molecular Biology, Plant Functional Genomics: Methods and Protocols 236:273-286.

A construct where the stem is formed by at least 30 nucleotides from a gene to be suppressed and the loop is formed by a random nucleotide sequence has also effectively been used for suppression (WO 1999/61632).

The use of poly-T and poly-A sequences to generate the stem in the stem-loop structure has also been described (WO 2002/00894).

Yet another variation includes using synthetic repeats to promote formation of a stem in the stem-loop structure. Transgenic organisms prepared with such recombinant DNA fragments have been shown to have reduced levels of the protein encoded by the nucleotide fragment forming the loop as described in PCT Publication Number WO 2002/00904.

RNA interference refers to the process of sequence-specific post-transcriptional gene silencing in animals mediated by short interfering RNAs (siRNAs) (Fire, et al., (1998) Nature 391:806). The corresponding process in plants is commonly referred to as post-transcriptional gene silencing (PTGS) or RNA silencing and is also referred to as quelling in fungi. The process of post-transcriptional gene silencing is thought to be an evolutionarily-conserved cellular defense mechanism used to prevent the expression of foreign genes and is commonly shared by diverse flora and phyla (Fire, et al., (1999) Trends Genet. 15:358). Such protection from foreign gene expression may have evolved in response to the production of double-stranded RNAs (dsRNAs) derived from viral infection or from the random integration of transposon elements into a host genome via a cellular response that specifically destroys homologous single-stranded RNA of viral genomic RNA. The presence of dsRNA in cells triggers the RNAi response through a mechanism that has yet to be fully characterized.

The presence of long dsRNAs in cells stimulates the activity of a ribonuclease III enzyme referred to as dicer. Dicer is involved in the processing of the dsRNA into short pieces of dsRNA known as short interfering RNAs (siRNAs) (Berstein, et al., (2001) Nature 409:363). Short interfering RNAs derived from dicer activity are typically about 21 to about 23 nucleotides in length and comprise about 19 base pair duplexes (Elbashir, et al., (2001) Genes Dev. 15:188). Dicer has also been implicated in the excision of 21- and 22-nucleotide small temporal RNAs (stRNAs) from precursor RNA of conserved structure that are implicated in translational control (Hutvagner, et al., (2001) Science 293:834). The RNAi response also features an endonuclease complex, commonly referred to as an RNA-induced silencing complex (RISC), which mediates cleavage of single-stranded RNA having sequence complementarity to the antisense strand of the siRNA duplex. Cleavage of the target RNA takes place in the middle of the region complementary to the antisense strand of the siRNA duplex (Elbashir, et al., (2001) Genes Dev. 15:188). In addition, RNA interference can also involve small RNA (e.g., miRNA) mediated gene silencing, presumably through cellular mechanisms that regulate chromatin structure and thereby prevent transcription of target gene sequences (see, e.g., Allshire, (2002) Science 297:1818-1819; Volpe, et al., (2002) Science 297:1833-1837; Jenuwein, (2002) Science 297:2215-2218; and Hall, et al., (2002) Science 297:2232-2237). As such, miRNA molecules of the invention can be used to mediate gene silencing via interaction with RNA transcripts or alternately by interaction with particular gene sequences, wherein such interaction results in gene silencing either at the transcriptional or post-transcriptional level.

Methods and compositions are further provided which allow for an increase in RNAi produced from the silencing element. In such embodiments, the methods and compositions employ a first polynucleotide comprising a silencing element for a target pest sequence operably linked to a promoter active in the plant cell; and, a second polynucleotide comprising a suppressor enhancer element comprising the target pest sequence or an active variant or fragment thereof operably linked to a promoter active in the plant cell. The combined expression of the silencing element with suppressor enhancer element leads to an increased amplification of the inhibitory RNA produced from the silencing element over that achievable with only the expression of the silencing element alone. In addition to the increased amplification of the specific RNAi species itself, the methods and compositions further allow for the production of a diverse population of RNAi species that can enhance the effectiveness of disrupting target gene expression. As such, when the suppressor enhancer element is expressed in a plant cell in combination with the silencing element, the methods and composition can allow for the systemic production of RNAi throughout the plant; the production of greater amounts of RNAi than would be observed with just the silencing element construct alone; and, the improved loading of RNAi into the phloem of the plant, thus providing better control of phloem feeding insects by an RNAi approach. Thus, the various methods and compositions provide improved methods for the delivery of inhibitory RNA to the target organism. See, for example, US 2009/0188008.

As used herein, a “suppressor enhancer element” comprises a polynucleotide comprising the target sequence to be suppressed or an active fragment or variant thereof. It is recognize that the suppressor enhancer element need not be identical to the target sequence, but rather, the suppressor enhancer element can comprise a variant of the target sequence, so long as the suppressor enhancer element has sufficient sequence identity to the target sequence to allow for an increased level of the RNAi produced by the silencing element over that achievable with only the expression of the silencing element. Similarly, the suppressor enhancer element can comprise a fragment of the target sequence, wherein the fragment is of sufficient length to allow for an increased level of the RNAi produced by the silencing element over that achievable with only the expression of the silencing element.

It is recognized that multiple suppressor enhancer elements from the same target sequence or from different target sequences, or from different regions of the same target sequence can be employed. For example, the suppressor enhancer elements employed can comprise fragments of the target sequence derived from different region of the target sequence (i.e., from the 3′UTR, coding sequence, intron, and/or 5′UTR). Further, the suppressor enhancer element can be contained in an expression cassette, as described elsewhere herein, and in specific embodiments, the suppressor enhancer element is on the same or on a different DNA vector or construct as the silencing element. The suppressor enhancer element can be operably linked to a promoter as disclosed herein. It is recognized that the suppressor enhancer element can be expressed constitutively or alternatively, it may be produced in a stage-specific manner employing the various inducible or tissue-preferred or developmentally regulated promoters that are discussed elsewhere herein.

In specific embodiments, employing both a silencing element and the suppressor enhancer element the systemic production of RNAi occurs throughout the entire plant. In further embodiments, the plant or plant parts of the invention have an improved loading of RNAi into the phloem of the plant than would be observed with the expression of the silencing element construct alone and, thus provide better control of phloem feeding insects by an RNAi approach. In specific embodiments, the plants, plant parts, and plant cells of the invention can further be characterized as allowing for the production of a diversity of RNAi species that can enhance the effectiveness of disrupting target gene expression.

In specific embodiments, the combined expression of the silencing element and the suppressor enhancer element increases the concentration of the inhibitory RNA in the plant cell, plant, plant part, plant tissue or phloem over the level that is achieved when the silencing element is expressed alone.

As used herein, an “increased level of inhibitory RNA” comprises any statistically significant increase in the level of RNAi produced in a plant having the combined expression when compared to an appropriate control plant. For example, an increase in the level of RNAi in the plant, plant part or the plant cell can comprise at least about a 1%, about a 1%-5%, about a 5%-10%, about a 10%-20%, about a 20%-30%, about a 30%-40%, about a 40%-50%, about a 50%-60%, about 60-70%, about 70%-80%, about a 80%-90%, about a 90%-100% or greater increase in the level of RNAi in the plant, plant part, plant cell, or phloem when compared to an appropriate control. In other embodiments, the increase in the level of RNAi in the plant, plant part, plant cell, or phloem can comprise at least about a 1 fold, about a 1 fold-5 fold, about a 5 fold-10 fold, about a 10 fold-20 fold, about a 20 fold-30 fold, about a 30 fold-40 fold, about a 40 fold-50 fold, about a 50 fold-60 fold, about 60 fold-70 fold, about 70 fold-80 fold, about a 80 fold-90 fold, about a 90 fold-100 fold or greater increase in the level of RNAi in the plant, plant part, plant cell or phloem when compared to an appropriate control. Examples of combined expression of the silencing element with suppressor enhancer element for the control of Stinkbugs and Lygus can be found in US 2011/0301223 and US 2009/0192117.

Some embodiments relate to down-regulation of expression of target genes in insect pest species by interfering ribonucleic acid (RNA) molecules. WO 2007/074405 describes methods of inhibiting expression of target genes in invertebrate pests including Colorado potato beetle. WO 2005/110068 describes methods of inhibiting expression of target genes in invertebrate pests including in particular Western corn rootworm as a means to control insect infestation. Furthermore, WO 2009/091864 describes compositions and methods for the suppression of target genes from insect pest species including pests from the Lygus genus. Nucleic acid molecules including RNAi for targeting the vacuolar ATPase H subunit, useful for controlling a coleopteran pest population and infestation as described in US Patent Application Publication 2012/0198586. WO 2012/055982 describes ribonucleic acid (RNA or double stranded RNA) that inhibits or down regulates the expression of a target gene that encodes: an insect ribosomal protein such as the ribosomal protein L19, the ribosomal protein L40 or the ribosomal protein S27A; an insect proteasome subunit such as the Rpn6 protein, the Pros 25, the Rpn2 protein, the proteasome beta 1 subunit protein or the Pros beta 2 protein; an insect β-coatomer of the COPI vesicle, the γ-coatomer of the COPI vesicle, the β′-coatomer protein or the ζ-coatomer of the COPI vesicle; an insect Tetraspanine 2 Å protein which is a putative transmembrane domain protein; an insect protein belonging to the actin family such as Actin 5C; an insect ubiquitin-5E protein; an insect Sec23 protein which is a GTPase activator involved in intracellular protein transport; an insect crinkled protein which is an unconventional myosin which is involved in motor activity; an insect crooked neck protein which is involved in the regulation of nuclear alternative mRNA splicing; an insect vacuolar H+-ATPase G-subunit protein; and an insect Tbp-1 such as Tat-binding protein. US Patent Application Publications 2012/029750, US 20120297501, and 2012/0322660 describe interfering ribonucleic acids (RNA or double stranded RNA) that functions upon uptake by an insect pest species to down-regulate expression of a target gene in said insect pest, wherein the RNA comprises at least one silencing element wherein the silencing element is a region of double-stranded RNA comprising annealed complementary strands, one strand of which comprises or consists of a sequence of nucleotides which is at least partially complementary to a target nucleotide sequence within the target gene. US Patent Application Publication 2012/0164205 describe potential targets for interfering double stranded ribonucleic acids for inhibiting invertebrate pests including: a Chd3 Homologous Sequence, a Beta-Tubulin Homologous Sequence, a 40 kDa V-ATPase Homologous Sequence, a EF1a Homologous Sequence, a 26S Proteosome Subunit p28 Homologous Sequence, a Juvenile Hormone Epoxide Hydrolase Homologous Sequence, a Swelling Dependent Chloride Channel Protein Homologous Sequence, a Glucose-6-Phosphate 1-Dehydrogenase Protein Homologous Sequence, an Act42A Protein Homologous Sequence, a ADP-Ribosylation Factor 1 Homologous Sequence, a Transcription Factor IIB Protein Homologous Sequence, a Chitinase Homologous Sequences, a Ubiquitin Conjugating Enzyme Homologous Sequence, a Glyceraldehyde-3-Phosphate Dehydrogenase Homologous Sequence, an Ubiquitin B Homologous Sequence, a Juvenile Hormone Esterase Homolog, and an Alpha Tubuliln Homologous Sequence.

Use in Pesticidal Control

General methods for employing strains comprising a nucleic acid sequence of the embodiments, or a variant thereof, in pesticide control or in engineering other organisms as pesticidal agents are known in the art. See, for example U.S. Pat. No. 5,039,523 and EP 0480762A2.

Microorganism hosts that are known to occupy the “phytosphere” (phylloplane, phyllosphere, rhizosphere, and/or rhizoplana) of one or more crops of interest may be selected. These microorganisms are selected so as to be capable of successfully competing in the particular environment with the wild-type microorganisms, provide for stable maintenance and expression of the gene expressing the PHI-4 polypeptide, and desirably, provide for improved protection of the pesticide from environmental degradation and inactivation.

Such microorganisms include bacteria, algae, and fungi. Of particular interest are microorganisms such as bacteria, e.g., Pseudomonas, Erwinia, Serratia, Klebsiella, Xanthomonas, Streptomyces, Rhizobium, Rhodopseudomonas, Methylius, Agrobacterium, Acetobacter, Lactobacillus, Arthrobacter, Azotobacter, Leuconostoc, and Alcaligenes, fungi, particularly yeast, e.g., Saccharomyces, Cryptococcus, Kluyveromyces, Sporobolomyces, Rhodotorula, and Aureobasidium. Of particular interest are such phytosphere bacterial species as Pseudomonas syringae, Pseudomonas fluorescens, Pseudomonas chlororaphis, Serratia marcescens, Acetobacter xylinum, Agrobacteria, Rhodopseudomonas spheroides, Xanthomonas campestris, Rhizobium melioti, Alcaligenes entrophus, Clavibacter xyli and Azotobacter vinelandii and phytosphere yeast species such as Rhodotorula rubra, R. glutinis, R. marina, R. aurantiaca, Cryptococcus albidus, C. diffluens, C. laurentii, Saccharomyces rosei, S. pretoriensis, S. cerevisiae, Sporobolomyces roseus, S. odorus, Kluyveromyces veronae, and Aureobasidium pollulans. Of particular interest are the pigmented microorganisms. Host organisms of particular interest include yeast, such as Rhodotorula spp., Aureobasidium spp., Saccharomyces spp. (such as S. cerevisiae), Sporobolomyces spp., phylloplane organisms such as Pseudomonas spp. (such as P. aeruginosa, P. fluorescens, P. chlororaphis), Erwinia spp., and Flavobacterium spp., and other such organisms, including Agrobacterium tumefaciens, E. coli, Bacillus subtilis, Bacillus cereus and the like.

Genes encoding the PHI-4 polypeptides of the embodiments can be introduced into microorganisms that multiply on plants (epiphytes) to deliver PHI-4 polypeptides to potential target pests. Epiphytes, for example, can be gram-positive or gram-negative bacteria.

Root-colonizing bacteria, for example, can be isolated from the plant of interest by methods known in the art. Specifically, a Bacillus cereus strain that colonizes roots can be isolated from roots of a plant (see, for example, Handelsman, et al., (1991) Appl. Environ. Microbiol. 56:713-718). Genes encoding the PHI-4 polypeptides of the embodiments can be introduced into a root-colonizing Bacillus cereus by standard methods known in the art.

Genes encoding PHI-4 polypeptides can be introduced, for example, into the root-colonizing Bacillus by means of electro transformation. Specifically, genes encoding the PHI-4 polypeptides can be cloned into a shuttle vector, for example, pHT3101 (Lerecius, et al., (1989) FEMS Microbiol. Letts. 60:211-218. The shuttle vector pHT3101 containing the coding sequence for the particular PHI-4 polypeptide gene can, for example, be transformed into the root-colonizing Bacillus by means of electroporation (Lerecius, et al., (1989) FEMS Microbiol. Letts. 60:211-218).

Expression systems can be designed so that PHI-4 polypeptides are secreted outside the cytoplasm of gram-negative bacteria, such as E. coli, for example. Advantages of having PHI-4 polypeptides secreted are: (1) avoidance of potential cytotoxic effects of the PHI-4 polypeptide expressed; and (2) improvement in the efficiency of purification of the PHI-4 polypeptide, including, but not limited to, increased efficiency in the recovery and purification of the protein per volume cell broth and decreased time and/or costs of recovery and purification per unit protein.

PHI-4 polypeptides can be made to be secreted in E. coli, for example, by fusing an appropriate E. coli signal peptide to the amino-terminal end of the PHI-4 polypeptide. Signal peptides recognized by E. coli can be found in proteins already known to be secreted in E. coli, for example the OmpA protein (Ghrayeb, et al., (1984) EMBO J, 3:2437-2442). OmpA is a major protein of the E. coli outer membrane, and thus its signal peptide is thought to be efficient in the translocation process. Also, the OmpA signal peptide does not need to be modified before processing as may be the case for other signal peptides, for example lipoprotein signal peptide (Duffaud, et al., (1987) Meth. Enzymol. 153:492).

PHI-4 polypeptides of the embodiments can be fermented in a bacterial host and the resulting bacteria processed and used as a microbial spray in the same manner that Bt strains have been used as insecticidal sprays. In the case of a PHI-4 polypeptide that is secreted from Bacillus, the secretion signal is removed or mutated using procedures known in the art. Such mutations and/or deletions prevent secretion of the PHI-4 polypeptide into the growth medium during the fermentation process. The PHI-4 polypeptides are retained within the cell, and the cells are then processed to yield the encapsulated PHI-4 polypeptides. Any suitable microorganism can be used for this purpose. Pseudomonas has been used to express Bt toxins as encapsulated proteins and the resulting cells processed and sprayed as an insecticide (Gaertner, et al., (1993), in: Advanced Engineered Pesticides, ed. Kim).

Alternatively, the PHI-4 polypeptides are produced by introducing a heterologous gene into a cellular host. Expression of the heterologous gene results, directly or indirectly, in the intracellular production and maintenance of the pesticide. These cells are then treated under conditions that prolong the activity of the toxin produced in the cell when the cell is applied to the environment of target pest(s). The resulting product retains the toxicity of the toxin. These naturally encapsulated PHI-4 polypeptides may then be formulated in accordance with conventional techniques for application to the environment hosting a target pest, e.g., soil, water, and foliage of plants. See, for example EPA 0192319, and the references cited therein.

Pesticidal Compositions

In some embodiments the active ingredients can be applied in the form of compositions and can be applied to the crop area or plant to be treated, simultaneously or in succession, with other compounds. These compounds can be fertilizers, weed killers, cryoprotectants, surfactants, detergents, pesticidal soaps, dormant oils, polymers, and/or time-release or biodegradable carrier formulations that permit long-term dosing of a target area following a single application of the formulation. They can also be selective herbicides, chemical insecticides, virucides, microbicides, amoebicides, pesticides, fungicides, bacteriocides, nematocides, molluscicides or mixtures of several of these preparations, if desired, together with further agriculturally acceptable carriers, surfactants or application-promoting adjuvants customarily employed in the art of formulation. Suitable carriers and adjuvants can be solid or liquid and correspond to the substances ordinarily employed in formulation technology, e.g. natural or regenerated mineral substances, solvents, dispersants, wetting agents, tackifiers, binders or fertilizers. Likewise the formulations may be prepared into edible “baits” or fashioned into pest “traps” to permit feeding or ingestion by a target pest of the pesticidal formulation.

Methods of applying an active ingredient or an agrochemical composition that contains at least one of the PHI-4 polypeptides produced by the bacterial strains include leaf application, seed coating and soil application. The number of applications and the rate of application depend on the intensity of infestation by the corresponding pest.

The composition may be formulated as a powder, dust, pellet, granule, spray, emulsion, colloid, solution, or such like, and may be prepared by such conventional means as desiccation, lyophilization, homogenation, extraction, filtration, centrifugation, sedimentation, or concentration of a culture of cells comprising the polypeptide. In all such compositions that contain at least one such pesticidal polypeptide, the polypeptide may be present in a concentration of from about 1% to about 99% by weight.

Lepidopteran, dipteran, heteropteran, nematode, hemiptera, or coleopteran pests may be killed or reduced in numbers in a given area by the methods of the disclosure, or may be prophylactically applied to an environmental area to prevent infestation by a susceptible pest. Preferably the pest ingests, or is contacted with, a pesticidally-effective amount of the polypeptide. By “pesticidally-effective amount” is intended an amount of the pesticide that is able to bring about death to at least one pest, or to noticeably reduce pest growth, feeding, or normal physiological development. This amount will vary depending on such factors as, for example, the specific target pests to be controlled, the specific environment, location, plant, crop or agricultural site to be treated, the environmental conditions, and the method, rate, concentration, stability, and quantity of application of the pesticidally-effective polypeptide composition. The formulations may also vary with respect to climatic conditions, environmental considerations, and/or frequency of application and/or severity of pest infestation.

The pesticide compositions described may be made by formulating either the bacterial cell, crystal and/or spore suspension, or isolated protein component with the desired agriculturally-acceptable carrier. The compositions may be formulated prior to administration in an appropriate means such as lyophilized, freeze-dried, desiccated, or in an aqueous carrier, medium or suitable diluent, such as saline or other buffer. The formulated compositions may be in the form of a dust or granular material, or a suspension in oil (vegetable or mineral), or water or oil/water emulsions, or as a wettable powder, or in combination with any other carrier material suitable for agricultural application. Suitable agricultural carriers can be solid or liquid and are well known in the art. The term “agriculturally-acceptable carrier” covers all adjuvants, inert components, dispersants, surfactants, tackifiers, binders, etc. that are ordinarily used in pesticide formulation technology; these are well known to those skilled in pesticide formulation. The formulations may be mixed with one or more solid or liquid adjuvants and prepared by various means, e.g., by homogeneously mixing, blending and/or grinding the pesticidal composition with suitable adjuvants using conventional formulation techniques. Suitable formulations and application methods are described in U.S. Pat. No. 6,468,523, herein incorporated by reference. The plants can also be treated with one or more chemical compositions, including one or more herbicide, insecticides, or fungicides. Exemplary chemical compositions include: Fruits/Vegetables Herbicides: Atrazine, Bromacil, Diuron, Glyphosate, Linuron, Metribuzin, Simazine, Trifluralin, Fluazifop, Glufosinate, Halo sulfuron Gowan, Paraquat, Propyzamide, Sethoxydim, Butafenacil, Halosulfuron, Indaziflam; Fruits/Vegetables Insecticides: Aldicarb, Bacillus thuriengiensis, Carbaryl, Carbofuran, Chlorpyrifos, Cypermethrin, Deltamethrin, Diazinon, Malathion, Abamectin, Cyfluthrin/beta-cyfluthrin, Esfenvalerate, Lambda-cyhalothrin, Acequinocyl, Bifenazate, Methoxyfenozide, Novaluron, Chromafenozide, Thiacloprid, Dinotefuran, Fluacrypyrim, Tolfenpyrad, Clothianidin, Spirodiclofen, Gamma-cyhalothrin, Spiromesifen, Spinosad, Rynaxypyr, Cyazypyr, Spinoteram, Triflumuron, Spirotetramat, Imidacloprid, Flubendiamide, Thiodicarb, Metaflumizone, Sulfoxaflor, Cyflumetofen, Cyanopyrafen, Imidacloprid, Clothianidin, Thiamethoxam, Spinotoram, Thiodicarb, Flonicamid, Methiocarb, Emamectin-benzoate, Indoxacarb, Forthiazate, Fenamiphos, Cadusaphos, Pyriproxifen, Fenbutatin-oxid, Hexthiazox, Methomyl, 4-[[(6-Chlorpyridin-3-yl)methyl](2,2-difluorethyl)amino]furan-2(5H)-on; Fruits/Vegetables Fungicides: Carbendazim, Chlorothalonil, EBDCs, Sulphur, Thiophanate-methyl, Azoxystrobin, Cymoxanil, Fluazinam, Fosetyl, Iprodione, Kresoxim-methyl, Metalaxyl/mefenoxam, Trifloxystrobin, Ethaboxam, Iprovalicarb, Trifloxystrobin, Fenhexamid, Oxpoconazole fumarate, Cyazofamid, Fenamidone, Zoxamide, Picoxystrobin, Pyraclostrobin, Cyflufenamid, Boscalid; Cereals Herbicides: Isoproturon, Bromoxynil, Ioxynil, Phenoxies, Chlorsulfuron, Clodinafop, Diclofop, Diflufenican, Fenoxaprop, Florasulam, Fluoroxypyr, Metsulfuron, Triasulfuron, Flucarbazone, Iodosulfuron, Propoxycarbazone, Picolinafen, Mesosulfuron, Beflubutamid, Pinoxaden, Amidosulfuron, Thifensulfuron Methyl, Tribenuron, Flupyrsulfuron, Sulfosulfuron, Pyrasulfotole, Pyroxsulam, Flufenacet, Tralkoxydim, Pyroxasulfon; Cereals Fungicides: Carbendazim, Chlorothalonil, Azoxystrobin, Cyproconazole, Cyprodinil, Fenpropimorph, Epoxiconazole, Kresoxim-methyl, Quinoxyfen, Tebuconazole, Trifloxystrobin, Simeconazole, Picoxystrobin, Pyraclostrobin, Dimoxystrobin, Prothioconazole, Fluoxastrobin; Cereals Insecticides: Dimethoate, Lambda-cyhalthrin, Deltamethrin, alpha-Cypermethrin, β-cyfluthrin, Bifenthrin, Imidacloprid, Clothianidin, Thiamethoxam, Thiacloprid, Acetamiprid, Dinetofuran, Clorphyriphos, Metamidophos, Oxidemethon-methyl, Pirimicarb, Methiocarb; Maize Herbicides: Atrazine, Alachlor, Bromoxynil, Acetochlor, Dicamba, Clopyralid, (S-) Dimethenamid, Glufosinate, Glyphosate, Isoxaflutole, (S-)Metolachlor, Mesotrione, Nicosulfuron, Primisulfuron, Rimsulfuron, Sulcotrione, Foramsulfuron, Topramezone, Tembotrione, Saflufenacil, Thiencarbazone, Flufenacet, Pyroxasulfon; Maize Insecticides: Carbofuran, Chlorpyrifos, Bifenthrin, Fipronil, Imidacloprid, Lambda-Cyhalothrin, Tefluthrin, Terbufos, Thiamethoxam, Clothianidin, Spiromesifen, Flubendiamide, Triflumuron, Rynaxypyr, Deltamethrin, Thiodicarb, β-Cyfluthrin, Cypermethrin, Bifenthrin, Lufenuron, Triflumoron, Tefluthrin, Tebupirimphos, Ethiprole, Cyazypyr, Thiacloprid, Acetamiprid, Dinetofuran, Avermectin, Methiocarb, Spirodiclofen, Spirotetramat; Maize Fungicides: Fenitropan, Thiram, Prothioconazole, Tebuconazole, Trifloxystrobin; Rice Herbicides: Butachlor, Propanil, Azimsulfuron, Bensulfuron, Cyhalofop, Daimuron, Fentrazamide, Imazosulfuron, Mefenacet, Oxaziclomefone, Pyrazosulfuron, Pyributicarb, Quinclorac, Thiobencarb, Indanofan, Flufenacet, Fentrazamide, Halosulfuron, Oxaziclomefone, Benzobicyclon, Pyriftalid, Penoxsulam, Bispyribac, Oxadiargyl, Ethoxysulfuron, Pretilachlor, Mesotrione, Tefuryltrione, Oxadiazone, Fenoxaprop, Pyrimisulfan; Rice Insecticides: Diazinon, Fenitrothion, Fenobucarb, Monocrotophos, Benfuracarb, Buprofezin, Dinotefuran, Fipronil, Imidacloprid, Isoprocarb, Thiacloprid, Chromafenozide, Thiacloprid, Dinotefuran, Clothianidin, Ethiprole, Flubendiamide, Rynaxypyr, Deltamethrin, Acetamiprid, Thiamethoxam, Cyazypyr, Spinosad, Spinotoram, Emamectin-Benzoate, Cypermethrin, Chlorpyriphos, Cartap, Methamidophos, Etofenprox, Triazophos, 4-[[(6-Chlorpyridin-3-yl)methyl](2,2-difluorethyl)amino]furan-2(5H)-on, Carbofuran, Benfuracarb; Rice Fungicides: Thiophanate-methyl, Azoxystrobin, Carpropamid, Edifenphos, Ferimzone, Iprobenfos, Isoprothiolane, Pencycuron, Probenazole, Pyroquilon, Tricyclazole, Trifloxystrobin, Diclocymet, Fenoxanil, Simeconazole, Tiadinil; Cotton Herbicides: Diuron, Fluometuron, MSMA, Oxyfluorfen, Prometryn, Trifluralin, Carfentrazone, Clethodim, Fluazifop-butyl, Glyphosate, Norflurazon, Pendimethalin, Pyrithiobac-sodium, Trifloxysulfuron, Tepraloxydim, Glufosinate, Flumioxazin, Thidiazuron; Cotton Insecticides: Acephate, Aldicarb, Chlorpyrifos, Cypermethrin, Deltamethrin, Malathion, Monocrotophos, Abamectin, Acetamiprid, Emamectin Benzoate, Imidacloprid, Indoxacarb, Lambda-Cyhalothrin, Spinosad, Thiodicarb, Gamma-Cyhalothrin, Spiromesifen, Pyridalyl, Flonicamid, Flubendiamide, Triflumuron, Rynaxypyr, Beta-Cyfluthrin, Spirotetramat, Clothianidin, Thiamethoxam, Thiacloprid, Dinetofuran, Flubendiamide, Cyazypyr, Spinosad, Spinotoram, gamma Cyhalothrin, 4-[[(6-Chlorpyridin-3-yl)methyl](2,2-difluorethyl)amino]furan-2(5H)-on, Thiodicarb, Avermectin, Flonicamid, Pyridalyl, Spiromesifen, Sulfoxaflor, Profenophos, Thriazophos, Endosulfan; Cotton Fungicides: Etridiazole, Metalaxyl, Quintozene; Soybean Herbicides: Alachlor, Bentazone, Trifluralin, Chlorimuron-Ethyl, Cloransulam-Methyl, Fenoxaprop, Fomesafen, Fluazifop, Glyphosate, Imazamox, Imazaquin, Imazethapyr, (S-)Metolachlor, Metribuzin, Pendimethalin, Tepraloxydim, Glufosinate; Soybean Insecticides: Lambda-cyhalothrin, Methomyl, Parathion, Thiocarb, Imidacloprid, Clothianidin, Thiamethoxam, Thiacloprid, Acetamiprid, Dinetofuran, Flubendiamide, Rynaxypyr, Cyazypyr, Spinosad, Spinotoram, Emamectin-Benzoate, Fipronil, Ethiprole, Deltamethrin, δ-Cyfluthrin, gamma and lambda Cyhalothrin, 4-[[(6-Chlorpyridin-3-yl)methyl](2,2-difluorethyl)amino]furan-2(5H)-on, Spirotetramat, Spinodiclofen, Triflumuron, Flonicamid, Thiodicarb, beta-Cyfluthrin; Soybean Fungicides: Azoxystrobin, Cyproconazole, Epoxiconazole, Flutriafol, Pyraclostrobin, Tebuconazole, Trifloxystrobin, Prothioconazole, Tetraconazole; Sugarbeet Herbicides: Chloridazon, Desmedipham, Ethofumesate, Phenmedipham, Triallate, Clopyralid, Fluazifop, Lenacil, Metamitron, Quinmerac, Cycloxydim, Triflusulfuron, Tepraloxydim, Quizalofop; Sugarbeet Insecticides: Imidacloprid, Clothianidin, Thiamethoxam, Thiacloprid, Acetamiprid, Dinetofuran, Deltamethrin, δ-Cyfluthrin, gamma/lambda Cyhalothrin, 4-[[(6-Chlorpyridin-3-yl)methyl](2,2-difluorethyl)amino]furan-2(5H)-on, Tefluthrin, Rynaxypyr, Cyaxypyr, Fipronil, Carbofuran; Canola Herbicides: Clopyralid, Diclofop, Fluazifop, Glufosinate, Glyphosate, Metazachlor, Trifluralin Ethametsulfuron, Quinmerac, Quizalofop, Clethodim, Tepraloxydim; Canola Fungicides: Azoxystrobin, Carbendazim, Fludioxonil, Iprodione, Prochloraz, Vinclozolin; Canola Insecticides: Carbofuran, Organophosphates, Pyrethroids, Thiacloprid, Deltamethrin, Imidacloprid, Clothianidin, Thiamethoxam, Acetamiprid, Dinetofuran, β-Cyfluthrin, gamma and lambda Cyhalothrin, tau-Fluvaleriate, Ethiprole, Spinosad, Spinotoram, Flubendiamide, Rynaxypyr, Cyazypyr, 4-[[(6-Chlorpyridin-3-yl)methyl](2,2-difluorethyl)amino]furan-2(5H)-on.

In some embodiments the herbicide is Atrazine, Bromacil, Diuron, Chlorsulfuron, Metsulfuron, Thifensulfuron Methyl, Tribenuron, Acetochlor, Dicamba, Isoxaflutole, Nicosulfuron, Rimsulfuron, Pyrithiobac-sodium, Flumioxazin, Chlorimuron-Ethyl, Metribuzin, Quizalofop, S-metolachlor, Hexazinne or combinations thereof.

In some embodiments the insecticide is Esfenvalerate, Chlorantraniliprole, Methomyl, Indoxacarb, Oxamyl, or combinations thereof.

Pesticidal and Insecticidal Activity

“Pest” includes but is not limited to, insects, fungi, bacteria, nematodes, mites, ticks, and the like. Insect pests include insects selected from the orders Coleoptera, Diptera, Hymenoptera, Lepidoptera, Mallophaga, Homoptera, Hemiptera, Orthroptera, Thysanoptera, Dermaptera, Isoptera, Anoplura, Siphonaptera, Trichoptera, etc., particularly Lepidoptera, and Hemiptera.

Those skilled in the art will recognize that not all compounds are equally effective against all pests. Compounds of the embodiments display activity against insect pests, which may include economically important agronomic, forest, greenhouse, nursery, ornamentals, food and fiber, public and animal health, domestic and commercial structure, household and stored product pests.

Larvae of the order Lepidoptera include, but are not limited to, armyworms, cutworms, loopers, and heliothines in the family Noctuidae Spodoptera frugiperda JE Smith (fall armyworm); S. exigua Hübner (beet armyworm); S. litura Fabricius (tobacco cutworm, cluster caterpillar); Mamestra configurata Walker (bertha armyworm); M. brassicae Linnaeus (cabbage moth); Agrotis ipsilon Hufnagel (black cutworm); A. orthogonia Morrison (western cutworm); A. subterranea Fabricius (granulate cutworm); Alabama argillacea Hübner (cotton leaf worm); Trichoplusia ni Hübner (cabbage looper); Pseudoplusia includens Walker (soybean looper); Anticarsia gemmatalis Hübner (velvetbean caterpillar); Hypena scabra Fabricius (green cloverworm); Heliothis virescens Fabricius (tobacco budworm); Pseudaletia unipuncta Haworth (armyworm); Athetis mindara Barnes and Mcdunnough (rough skinned cutworm); Euxoa messoria Harris (darksided cutworm); Earias insulana Boisduval (spiny bollworm); E. vittella Fabricius (spotted bollworm); Helicoverpa armigera Hübner (American bollworm); H. zea Boddie (corn earworm or cotton bollworm); Melanchra picta Harris (zebra caterpillar); Egira (Xylomyges) curialis Grote (citrus cutworm); borers, casebearers, webworms, coneworms, and skeletonizers from the family Pyralidae Ostrinia nubilalis Hübner (European corn borer); Amyelois transitella Walker (naval orangeworm); Anagasta kuehniella Zeller (Mediterranean flour moth); Cadra cautella Walker (almond moth); Chilo suppressalis Walker (rice stem borer); C. partellus, (sorghum borer); Corcyra cephalonica Stainton (rice moth); Crambus caliginosellus Clemens (corn root webworm); C. teterrellus Zincken (bluegrass webworm); Cnaphalocrocis medinalis Guenée (rice leaf roller); Desmia funeralis Hübner (grape leaffolder); Diaphania hyalinata Linnaeus (melon worm); D. nitidalis Stoll (pickleworm); Diatraea grandiosella Dyar (southwestern corn borer), D. saccharalis Fabricius (surgarcane borer); Eoreuma loftini Dyar (Mexican rice borer); Ephestia elutella Hübner (tobacco (cacao) moth); Galleria mellonella Linnaeus (greater wax moth); Herpetogramma licarsisalis Walker (sod webworm); Homoeosoma electellum Hulst (sunflower moth); Elasmopalpus lignosellus Zeller (lesser cornstalk borer); Achroia grisella Fabricius (lesser wax moth); Loxostege sticticalis Linnaeus (beet webworm); Orthaga thyrisalis Walker (tea tree web moth); Maruca testulalis Geyer (bean pod borer); Plodia interpunctella Hübner (Indian meal moth); Scirpophaga incertulas Walker (yellow stem borer); Udea rubigalis Guenée (celery leaftier); and leafrollers, budworms, seed worms, and fruit worms in the family Tortricidae Acleris gloverana Walsingham (Western blackheaded budworm); A. variana Fernald (Eastern blackheaded budworm); Archips argyrospila Walker (fruit tree leaf roller); A. rosana Linnaeus (European leaf roller); and other Archips species, Adoxophyes orana Fischer von Rösslerstamm (summer fruit tortrix moth); Cochylis hospes Walsingham (banded sunflower moth); Cydia latiferreana Walsingham (filbertworm); C. pomonella Linnaeus (coding moth); Platynota flavedana Clemens (variegated leafroller); P. stultana Walsingham (omnivorous leafroller); Lobesia botrana Denis & Schiffermüller (European grape vine moth); Spilonota ocellana Denis & Schiffermüller (eyespotted bud moth); Endopiza viteana Clemens (grape berry moth); Eupoecilia ambiguella Hübner (vine moth); Bonagota salubricola Meyrick (Brazilian apple leafroller); Grapholita molesta Busck (oriental fruit moth); Suleima helianthana Riley (sunflower bud moth); Argyrotaenia spp.; Choristoneura spp.

Selected other agronomic pests in the order Lepidoptera include, but are not limited to, Alsophila pometaria Harris (fall cankerworm); Anarsia lineatella Zeller (peach twig borer); Anisota senatoria J. E. Smith (orange striped oakworm); Antheraea pernyi Guérin-Méneville (Chinese Oak Tussah Moth); Bombyx mori Linnaeus (Silkworm); Bucculatrix thurberiella Busck (cotton leaf perforator); Collas eurytheme Boisduval (alfalfa caterpillar); Datana integerrima Grote & Robinson (walnut caterpillar); Dendrolimus sibiricus Tschetwerikov (Siberian silk moth), Ennomos subsignaria Hübner (elm spanworm); Erannis tiliaria Harris (linden looper); Euproctis chrysorrhoea Linnaeus (browntail moth); Harrisina americana Guérin-Méneville (grapeleaf skeletonizer); Hemileuca oliviae Cockrell (range caterpillar); Hyphantria cunea Drury (fall webworm); Keiferia lycopersicella Walsingham (tomato pinworm); Lambdina fiscellaria fiscellaria Hulst (Eastern hemlock looper); L. fiscellaria lugubrosa Hulst (Western hemlock looper); Leucoma salicis Linnaeus (satin moth); Lymantria dispar Linnaeus (gypsy moth); Manduca quinquemaculata Haworth (five spotted hawk moth, tomato hornworm); M. sexta Haworth (tomato hornworm, tobacco hornworm); Operophtera brumata Linnaeus (winter moth); Paleacrita vemata Peck (spring cankerworm); Papilio cresphontes Cramer (giant swallowtail, orange dog); Phryganidia californica Packard (California oakworm); Phyllocnistis citrella Stainton (citrus leafminer); Phyllonorycter blancardella Fabricius (spotted tentiform leafminer); Pieris brassicae Linnaeus (large white butterfly); P. rapae Linnaeus (small white butterfly); P. napi Linnaeus (green veined white butterfly); Platyptilia carduidactyla Riley (artichoke plume moth); Plutella xylostella Linnaeus (diamondback moth); Pectinophora gossypiella Saunders (pink bollworm); Pontia protodice Boisduval & Leconte (Southern cabbageworm); Sabulodes aegrotata Guenée (omnivorous looper); Schizura concinna J. E. Smith (red humped caterpillar); Sitotroga cerealella Olivier (Angoumois grain moth); Thaumetopoea pityocampa Schiffermuller (pine processionary caterpillar); Tineola bisselliella Hummel (webbing clothesmoth); Tuta absoluta Meyrick (tomato leafminer); Yponomeuta padella Linnaeus (ermine moth); Heliothis subflexa Guenée; Malacosoma spp. and Orgyia spp.

Of interest are larvae and adults of the order Coleoptera including weevils from the families Anthribidae, Bruchidae, and Curculionidae (including, but not limited to: Anthonomus grandis Boheman (boll weevil); Lissorhoptrus oryzophilus Kuschel (rice water weevil); Sitophilus granarius Linnaeus (granary weevil); S. oryzae Linnaeus (rice weevil); Hypera punctata Fabricius (clover leaf weevil); Cylindrocopturus adspersus LeConte (sunflower stem weevil); Smicronyx fulvus LeConte (red sunflower seed weevil); S. sordidus LeConte (gray sunflower seed weevil); Sphenophorus maidis Chittenden (maize billbug)); flea beetles, cucumber beetles, rootworms, leaf beetles, potato beetles, and leafminers in the family Chrysomelidae (including, but not limited to: Leptinotarsa decemlineata Say (Colorado potato beetle); Diabrotica virgifera virgifera LeConte (western corn rootworm); D. barberi Smith & Lawrence (northern corn rootworm); D. undecimpunctata howardi Barber (southern corn rootworm); Chaetocnema pulicaria Melsheimer (corn flea beetle); Phyllotreta cruciferae Goeze (corn flea beetle); Colaspis brunnea Fabricius (grape colaspis); Oulema melanopus Linnaeus (cereal leaf beetle); Zygogramma exclamationis Fabricius (sunflower beetle)); beetles from the family Coccinellidae (including, but not limited to: Epilachna varivestis Mulsant (Mexican bean beetle)); chafers and other beetles from the family Scarabaeidae (including, but not limited to: Popillia japonica Newman (Japanese beetle); Cyclocephala borealis Arrow (northern masked chafer, white grub); C. immaculata Olivier (southern masked chafer, white grub); Rhizotrogus majalis Razoumowsky (European chafer); Phyllophaga crinita Burmeister (white grub); Ligyrus gibbosus De Geer (carrot beetle)); carpet beetles from the family Dermestidae; wireworms from the family Elateridae, Eleodes spp., Melanotus spp.; Conoderus spp.; Limonius spp.; Agriotes spp.; Ctenicera spp.; Aeolus spp.; bark beetles from the family Scolytidae and beetles from the family Tenebrionidae.

Adults and immatures of the order Diptera are of interest, including leafminers Agromyza parvicornis Loew (corn blotch leafminer); midges (including, but not limited to: Contarinia sorghicola Coquillett (sorghum midge); Mayetiola destructor Say (Hessian fly); Sitodiplosis mosellana Géhin (wheat midge); Neolasioptera murtfeldtiana Felt, (sunflower seed midge)); fruit flies (Tephritidae), Oscinella frit Linnaeus (fruit flies); maggots (including, but not limited to: Delia platura Meigen (seedcorn maggot); D. coarctata Fallen (wheat bulb fly); and other Delia spp., Meromyza americana Fitch (wheat stem maggot); Musca domestica Linnaeus (house flies); Fannia canicularis Linnaeus, F. femoralis Stein (lesser house flies); Stomoxys calcitrans Linnaeus (stable flies)); face flies, horn flies, blow flies, Chrysomya spp.; Phormia spp.; and other muscoid fly pests, horse flies Tabanus spp.; bot flies Gastrophilus spp.; Oestrus spp.; cattle grubs Hypoderma spp.; deer flies Chrysops spp.; Melophagus ovinus Linnaeus (keds); and other Brachycera, mosquitoes Aedes spp.; Anopheles spp.; Culex spp.; black flies Prosimulium spp.; Simulium spp.; biting midges, sand flies, sciarids, and other Nematocera.

Included as insects of interest are adults and nymphs of the orders Hemiptera and Homoptera such as, but not limited to, adelgids from the family Adelgidae, plant bugs from the family Miridae, cicadas from the family Cicadidae, leafhoppers, Empoasca spp.; from the family Cicadellidae, planthoppers from the families Cixiidae, Flatidae, Fulgoroidea, lssidae and Delphacidae, treehoppers from the family Membracidae, psyllids from the family Psyllidae, whiteflies from the family Aleyrodidae, aphids from the family Aphididae, Phylloxera from the family Phylloxeridae, mealybugs from the family Pseudococcidae, scales from the families Asterolecanidae, Coccidae, Dactylopiidae, Diaspididae, Eriococcidae, Ortheziidae, Phoenicococcidae and Margarodidae, lace bugs from the family Tingidae, stink bugs from the family Pentatomidae, cinch bugs, Blissus spp.; and other seed bugs from the family Lygaeidae, spittlebugs from the family Cercopidae squash bugs from the family Coreidae, and red bugs and cotton stainers from the family Pyrrhocoridae.

Agronomically important members from the order Homoptera further include, but are not limited to: Acyrthisiphon pisum Harris (pea aphid); Aphis craccivora Koch (cowpea aphid); A. fabae Scopoli (black bean aphid); A. gossypii Glover (cotton aphid, melon aphid); A. maidiradicis Forbes (corn root aphid); A. pomi De Geer (apple aphid); A. spiraecola Patch (spirea aphid); Aulacorthum solani Kaltenbach (foxglove aphid); Chaetosiphon fragaefolii Cockerell (strawberry aphid); Diuraphis noxia Kurdjumov/Mordvilko (Russian wheat aphid); Dysaphis plantaginea Paaserini (rosy apple aphid); Eriosoma lanigerum Hausmann (woolly apple aphid); Brevicoryne brassicae Linnaeus (cabbage aphid); Hyalopterus pruni Geoffroy (mealy plum aphid); Lipaphis erysimi Kaltenbach (turnip aphid); Metopolophium dirrhodum Walker (cereal aphid); Macrosiphum euphorbiae Thomas (potato aphid); Myzus persicae Sulzer (peach-potato aphid, green peach aphid); Nasonovia ribisnigri Mosley (lettuce aphid); Pemphigus spp. (root aphids and gall aphids); Rhopalosiphum maidis Fitch (corn leaf aphid); R. padi Linnaeus (bird cherry-oat aphid); Schizaphis graminum Rondani (greenbug); Sipha flava Forbes (yellow sugarcane aphid); Sitobion avenae Fabricius (English grain aphid); Therioaphis maculata Buckton (spotted alfalfa aphid); Toxoptera aurantii Boyer de Fonscolombe (black citrus aphid); and T. citricida Kirkaldy (brown citrus aphid); Adelges spp. (adelgids); Phylloxera devastatrix Pergande (pecan phylloxera); Bemisia tabaci Gennadius (tobacco whitefly, sweetpotato whitefly); B. argentifolii Bellows & Perring (silverleaf whitefly); Dialeurodes citri Ashmead (citrus whitefly); Trialeurodes abutiloneus (bandedwinged whitefly) and T. vaporariorum Westwood (greenhouse whitefly); Empoasca fabae Harris (potato leafhopper); Laodelphax striatellus Fallen (smaller brown planthopper); Macrolestes quadrilineatus Forbes (aster leafhopper); Nephotettix cinticeps Uhler (green leafhopper); N. nigropictus St{dot over (a)}l (rice leafhopper); Nilaparvata lugens St{dot over (a)}l (brown planthopper); Peregrinus maidis Ashmead (corn planthopper); Sogatella furcifera Horvath (white-backed planthopper); Sogatodes orizicola Muir (rice delphacid); Typhlocyba pomaria McAtee (white apple leafhopper); Erythroneoura spp. (grape leafhoppers); Magicicada septendecim Linnaeus (periodical cicada); Icerya purchasi Maskell (cottony cushion scale); Quadraspidiotus perniciosus Comstock (San Jose scale); Planococcus citri Risso (citrus mealybug); Pseudococcus spp. (other mealybug complex); Cacopsylla pyricola Foerster (pear psylla); Trioza diospyri Ashmead (persimmon psylla).

Agronomically important species of interest from the order Hemiptera include, but are not limited to: Acrosternum hilare Say (green stink bug); Anasa tristis De Geer (squash bug); Blissus leucopterus leucopterus Say (chinch bug); Corythuca gossypii Fabricius (cotton lace bug); Cyrtopeltis modesta Distant (tomato bug); Dysdercus suturellus Herrich-Schäffer (cotton stainer); Euschistus servus Say (brown stink bug); E. variolarius Palisot de Beauvois (one-spotted stink bug); Graptostethus spp. (complex of seed bugs); Leptoglossus corculus Say (leaf-footed pine seed bug); Lygus lineolaris Palisot de Beauvois (tarnished plant bug); L. Hesperus Knight (Western tarnished plant bug); L. pratensis Linnaeus (common meadow bug); L. rugulipennis Poppius (European tarnished plant bug); Lygocoris pabulinus Linnaeus (common green capsid); Nezara viridula Linnaeus (southern green stink bug); Oebalus pugnax Fabricius (rice stink bug); Oncopeltus fasciatus Dallas (large milkweed bug); Pseudatomoscelis seriatus Reuter (cotton fleahopper).

Furthermore, embodiments may be effective against Hemiptera such, Calocoris norvegicus Gmelin (strawberry bug); Orthops campestris Linnaeus; Plesiocoris rugicollis Fallen (apple capsid); Cyrtopeltis modestus Distant (tomato bug); Cyrtopeltis notatus Distant (suckfly); Spanagonicus albofasciatus Reuter (whitemarked fleahopper); Diaphnocoris chlorionis Say (honeylocust plant bug); Labopidicola allii Knight (onion plant bug); Pseudatomoscelis seriatus Reuter (cotton fleahopper); Adelphocoris rapidus Say (rapid plant bug); Poecilocapsus lineatus Fabricius (four-lined plant bug); Nysius ericae Schilling (false chinch bug); Nysius raphanus Howard (false chinch bug); Nezara viridula Linnaeus (Southern green stink bug); Eurygaster spp.; Coreidae spp.; Pyrrhocoridae spp.; Tinidae spp.; Blostomatidae spp.; Reduviidae spp.; and Cimicidae spp.

Also included are adults and larvae of the order Acari (mites) such as Aceria tosichella Keifer (wheat curl mite); Petrobia latens Müller (brown wheat mite); spider mites and red mites in the family Tetranychidae, Panonychus ulmi Koch (European red mite); Tetranychus urticae Koch (two spotted spider mite); (T. mcdanieli McGregor (McDaniel mite); T. cinnabarinus Boisduval (carmine spider mite); T. turkestani Ugarov & Nikolski (strawberry spider mite); flat mites in the family Tenuipalpidae, Brevipalpus lewisi McGregor (citrus flat mite); rust and bud mites in the family Eriophyidae and other foliar feeding mites and mites important in human and animal health, i.e. dust mites in the family Epidermoptidae, follicle mites in the family Demodicidae, grain mites in the family Glycyphagidae, ticks in the order Ixodidae. Ixodes scapularis Say (deer tick); I. holocyclus Neumann (Australian paralysis tick); Dermacentor variabilis Say (American dog tick); Amblyomma americanum Linnaeus (lone star tick); and scab and itch mites in the families Psoroptidae, Pyemotidae, and Sarcoptidae.

Insect pests of the order Thysanura are of interest, such as Lepisma saccharina Linnaeus (silverfish); Thermobia domestica Packard (firebrat).

Additional arthropod pests covered include: spiders in the order Araneae such as Loxosceles reclusa Gertsch & Mulaik (brown recluse spider); and the Latrodectus mactans Fabricius (black widow spider); and centipedes in the order Scutigeromorpha such as Scutigera coleoptrata Linnaeus (house centipede).

Insect pest of interest include the superfamily of stink bugs and other related insects including but not limited to species belonging to the family Pentatomidae (Nezara viridula, Halyomorpha halys, Piezodorus guildini, Euschistus servus, Acrosternum hilare, Euschistus heros, Euschistus tristigmus, Acrosternum hilare, Dichelops furcatus, Dichelops melacanthus, and Bagrada hilaris (Bagrada Bug)), the family Plataspidae (Megacopta cribraria—Bean plataspid), and the family Cydnidae (Scaptocoris castanea—Root stink bug); and Lepidoptera species including but not limited to: diamond-back moth, e.g., Helicoverpa zea Boddie; soybean looper, e.g., Pseudoplusia includens Walker; and velvet bean caterpillar e.g., Anticarsia gemmatalis Hübner.

Methods for measuring pesticidal activity are well known in the art. See, for example, Czapla and Lang, (1990) J. Econ. Entomol. 83:2480-2485; Andrews, et al., (1988) Biochem. J. 252:199-206; Marrone, et al., (1985) J. of Economic Entomology 78:290-293 and U.S. Pat. No. 5,743,477, all of which are herein incorporated by reference in their entirety. Generally, the protein is mixed and used in feeding assays. See, for example Marrone, et al., (1985) J. of Economic Entomology 78:290-293. Such assays can include contacting plants with one or more pests and determining the plant's ability to survive and/or cause the death of the pests.

Nematodes include parasitic nematodes such as root-knot, cyst, and lesion nematodes, including Heterodera spp., Meloidogyne spp., and Globodera spp.; particularly members of the cyst nematodes, including, but not limited to, Heterodera glycines (soybean cyst nematode); Heterodera schachtii (beet cyst nematode); Heterodera avenae (cereal cyst nematode); and Globodera rostochiensis and Globodera pailida (potato cyst nematodes). Lesion nematodes include Pratylenchus spp.

Seed Treatment

To protect and to enhance yield production and trait technologies, seed treatment options can provide additional crop plan flexibility and cost effective control against insects, weeds and diseases. Seed material can be treated, typically surface treated, with a composition comprising combinations of chemical or biological herbicides, herbicide safeners, insecticides, fungicides, germination inhibitors and enhancers, nutrients, plant growth regulators and activators, bactericides, nematocides, avicides and/or molluscicides. These compounds are typically formulated together with further carriers, surfactants or application-promoting adjuvants customarily employed in the art of formulation. The coatings may be applied by impregnating propagation material with a liquid formulation or by coating with a combined wet or dry formulation. Examples of the various types of compounds that may be used as seed treatments are provided in The Pesticide Manual: A World Compendium, C.D.S. Tomlin Ed., Published by the British Crop Production Council, which is hereby incorporated by reference.

Some seed treatments that may be used on crop seed include, but are not limited to, one or more of abscisic acid, acibenzolar-S-methyl, avermectin, amitrol, azaconazole, azospirillum, azadirachtin, azoxystrobin, Bacillus spp. (including one or more of cereus, firmus, megaterium, pumilis, sphaericus, subtilis and/or thuringiensis species), bradyrhizobium spp. (including one or more of betae, canariense, elkanii, iriomotense, japonicum, liaonigense, pachyrhizi and/or yuanmingense), captan, carboxin, chitosan, clothianidin, copper, cyazypyr, difenoconazole, etidiazole, fipronil, fludioxonil, fluoxastrobin, fluquinconazole, flurazole, fluxofenim, harpin protein, imazalil, imidacloprid, ipconazole, isoflavenoids, lipo-chitooligosaccharide, mancozeb, manganese, maneb, mefenoxam, metalaxyl, metconazole, myclobutanil, PCNB, penflufen, penicillium, penthiopyrad, permethrine, picoxystrobin, prothioconazole, pyraclostrobin, rynaxypyr, S-metolachlor, saponin, sedaxane, TCMTB, tebuconazole, thiabendazole, thiamethoxam, thiocarb, thiram, tolclofos-methyl, triadimenol, trichoderma, trifloxystrobin, triticonazole and/or zinc. PCNB seed coat refers to EPA registration number 00293500419, containing quintozen and terrazole. TCMTB refers to 2-(thiocyanomethylthio) benzothiazole.

Seed varieties and seeds with specific transgenic traits may be tested to determine which seed treatment options and application rates may complement such varieties and transgenic traits in order to enhance yield. For example, a variety with good yield potential but head smut susceptibility may benefit from the use of a seed treatment that provides protection against head smut, a variety with good yield potential but cyst nematode susceptibility may benefit from the use of a seed treatment that provides protection against cyst nematode, and so on. Likewise, a variety encompassing a transgenic trait conferring insect resistance may benefit from the second mode of action conferred by the seed treatment, a variety encompassing a transgenic trait conferring herbicide resistance may benefit from a seed treatment with a safener that enhances the plants resistance to that herbicide, etc. Further, the good root establishment and early emergence that results from the proper use of a seed treatment may result in more efficient nitrogen use, a better ability to withstand drought and an overall increase in yield potential of a variety or varieties containing a certain trait when combined with a seed treatment.

Methods for Inhibiting Growth or Killing an Insect Pest and Controlling an Insect Population

In some embodiments methods are provided for inhibiting growth or killing an insect pest, comprising contacting the insect pest with an insecticidally-effective amount of a recombinant PHI-4 polypeptide. In some embodiments methods are provided for inhibiting growth or killing an insect pest, comprising contacting the insect pest with an insecticidally-effective amount of a recombinant pesticidal protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID NOs: 51-819 or a variant thereof.

In some embodiments methods are provided for controlling an insect pest population, comprising contacting the insect pest population with an insecticidally-effective amount of a recombinant PHI-4 polypeptide. In some embodiments methods are provided for controlling an insect pest population, comprising contacting the insect pest population with an insecticidally-effective amount of a recombinant pesticidal protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID NOs: 51-819 or a variant thereof. As used herein, by “controlling a pest population” or “controls a pest” is intended any effect on a pest that results in limiting the damage that the pest causes. Controlling a pest includes, but is not limited to, killing the pest, inhibiting development of the pest, altering fertility or growth of the pest in such a manner that the pest provides less damage to the plant, decreasing the number of offspring produced, producing less fit pests, producing pests more susceptible to predator attack or deterring the pests from eating the plant.

In some embodiments methods are provided for controlling an insect pest population resistant to a pesticidal protein, comprising contacting the insect pest population with an insecticidally-effective amount of a recombinant PHI-4 polypeptide. In some embodiments methods are provided for controlling an insect pest population resistant to a pesticidal protein, comprising contacting the insect pest population with an insecticidally-effective amount of a recombinant pesticidal protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID NOs: 51-819 or a variant thereof.

In some embodiments methods are provided for protecting a plant from an insect pest, comprising expressing in the plant or cell thereof a recombinant PHI-4 polypeptide. In some embodiments methods are provided for protecting a plant from an insect pest, comprising expressing in the plant or cell thereof a recombinant pesticidal protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID NOs: 51-819 or variants thereof.

Insect Resistance Management (IRM) Strategies

Expression of B. thuringiensis δ-endotoxins in transgenic corn plants has proven to be an effective means of controlling agriculturally important insect pests (Perlak, et al., 1990; 1993). However, insects have evolved that are resistant to B. thuringiensis δ-endotoxins expressed in transgenic plants. Such resistance, should it become widespread, would clearly limit the commercial value of germplasm containing genes encoding such B. thuringiensis δ-endotoxins.

One way to increasing the effectiveness of the transgenic insecticides against target pests and contemporaneously reducing the development of insecticide-resistant pests is to use provide non-transgenic (i.e., non-insecticidal protein) refuges (a section of non-insecticidal crops/corn) for use with transgenic crops producing a single insecticidal protein active against target pests. The United States Environmental Protection Agency (epa.gov/oppbppdl/biopesticides/pips/bt_corn_refuge_2006.htm, which can be accessed using the www prefix) publishes the requirements for use with transgenic crops producing a single Bt protein active against target pests. In addition, the National Corn Growers Association, on their website: (ncga.com/insect-resistance-management-fact-sheet-bt-corn, which can be accessed using the www prefix) also provides similar guidance regarding refuge requirements. Due to losses to insects within the refuge area, larger refuges may reduce overall yield.

Another way of increasing the effectiveness of the transgenic insecticides against target pests and contemporaneously reducing the development of insecticide-resistant pests would be to have a repository of insecticidal genes that are effective against groups of insect pests and which manifest their effects through different modes of action.

Expression in a plant of two or more insecticidal compositions toxic to the same insect species, each insecticide being expressed at efficacious levels would be another way to achieve control of the development of resistance. This is based on the principle that evolution of resistance against two separate modes of action is far more unlikely than only one. Roush for example, outlines two-toxin strategies, also called “pyramiding” or “stacking,” for management of insecticidal transgenic crops. (The Royal Society. Phil. Trans. R. Soc. Lond. B. (1998) 353:777-1786). Stacking or pyramiding of two different proteins each effective against the target pests and with little or no cross-resistance can allow for use of a smaller refuge. The U.S. Environmental Protection Agency requires significantly less (generally 5%) structured refuge of non-Bt corn be planted than for single trait products (generally 20%). There are various ways of providing the IRM effects of a refuge, including various geometric planting patterns in the fields and in-bag seed mixtures, as discussed further by Roush.

In some embodiments the PHI-4 polypeptides of the disclosure are useful as an insect resistance management strategy in combination (i.e., pyramided) with other pesticidal proteins include but are not limited to Bt toxins, Xenorhabdus sp. or Photorhabdus sp. insecticidal proteins, and the like.

Provided are methods of controlling Lepidoptera and/or Hemiptera insect infestation(s) in a transgenic plant that promote insect resistance management, comprising expressing in the plant at least two different insecticidal proteins having different modes of action.

In some embodiments the methods of controlling Lepidoptera and/or Hemiptera insect infestation in a transgenic plant and promoting insect resistance management the at least one of the insecticidal proteins comprise a PHI-4 polypeptide insecticidal to insects in the order Coleoptera.

In some embodiments the methods of controlling Lepidoptera and/or Hemiptera insect infestation in a transgenic plant and promoting insect resistance management the at least one of the insecticidal proteins comprises a protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 51-819 or variants thereof, insecticidal to insects in the order Coleoptera.

In some embodiments the methods of controlling Lepidoptera and/or Hemiptera insect infestation in a transgenic plant and promoting insect resistance management comprise expressing in the transgenic plant a PHI-4 polypeptide and a cry protein insecticidal to insects in the order Coleoptera having different modes of action.

In some embodiments the methods of controlling Coleoptera insect infestation in a transgenic plant and promoting insect resistance management comprise in the transgenic plant a protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 51-819 or variants thereof and a cry protein insecticidal to insects in the order Lepidoptera and/or Hemiptera having different modes of action.

Also provided are methods of reducing likelihood of emergence of Lepidoptera and/or Hemiptera insect resistance to transgenic plants expressing in the plants insecticidal proteins to control the insect species, comprising expression of a PHI-4 polypeptide insecticidal to the insect species in combination with a second insecticidal protein to the insect species having different modes of action.

Also provided are methods of reducing likelihood of emergence of Coleoptera insect resistance to transgenic plants expressing in the plants insecticidal proteins to control the insect species, comprising expression of a protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 51-819 or variants thereof, insecticidal to the insect species in combination with a second insecticidal protein to the insect species having different modes of action.

Also provided are means for effective Coleoptera insect resistance management of transgenic plants, comprising co-expressing at high levels in the plants two or more insecticidal proteins toxic to Lepidoptera and/or Hemiptera insects but each exhibiting a different mode of effectuating its inhibiting growth or killing activity, wherein the two or more insecticidal proteins comprise a PHI-4 polypeptide and a cry protein. Also provided are means for effective Lepidoptera and/or Hemiptera insect resistance management of transgenic plants, comprising co-expressing at high levels in the plants two or more insecticidal proteins toxic to Lepidoptera and/or Hemiptera insects but each exhibiting a different mode of effectuating its inhibiting growth or activity, wherein the two or more insecticidal proteins comprise a protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 51-819 or variants thereof and a cry protein.

In addition, methods are provided for obtaining regulatory approval for planting or commercialization of plants expressing proteins insecticidal to insects in the order Coleoptera, comprising the step of referring to, submitting or relying on insect assay binding data showing that the PHI-4 polypeptide does not compete with binding sites for cry proteins in such insects. In addition, methods are provided for obtaining regulatory approval for planting or commercialization of plants expressing proteins insecticidal to insects in the order Coleoptera, comprising the step of referring to, submitting or relying on insect assay binding data showing that the protein of SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 51-819 or variant thereof does not compete with binding sites for cry proteins in such insects.

Methods for Increasing Plant Yield

Methods for increasing plant yield are provided. The methods comprise providing a plant or plant cell expressing a polynucleotide encoding the pesticidal polypeptide sequence disclosed herein and growing the plant or a seed thereof in a field infested with a pest against which the polypeptide has pesticidal activity. In some embodiments, the polypeptide has pesticidal activity against a lepidopteran, coleopteran, dipteran, hemipteran or nematode pest, and the field is infested with a lepidopteran, hemipteran, coleopteran, dipteran or nematode pest.

As defined herein, the “yield” of the plant refers to the quality and/or quantity of biomass produced by the plant. By “biomass” is intended any measured plant product. An increase in biomass production is any improvement in the yield of the measured plant product. Increasing plant yield has several commercial applications. For example, increasing plant leaf biomass may increase the yield of leafy vegetables for human or animal consumption. Additionally, increasing leaf biomass can be used to increase production of plant-derived pharmaceutical or industrial products. An increase in yield can comprise any statistically significant increase including, but not limited to, at least a 1% increase, at least a 3% increase, at least a 5% increase, at least a 10% increase, at least a 20% increase, at least a 30%, at least a 50%, at least a 70%, at least a 100% or a greater increase in yield compared to a plant not expressing the pesticidal sequence.

In specific methods, plant yield is increased as a result of improved pest resistance of a plant expressing a PHI-4 polypeptide disclosed herein. Expression of the PHI-4 polypeptide results in a reduced ability of a pest to infest or feed on the plant, thus improving plant yield.

These and other changes may be made to the invention in light of the above detailed description. In general, in the following claims, the terms used should not be construed to limit the invention to the specific embodiments disclosed in the specification and the claims.

The entire disclosure of each document cited (including patents, patent applications, journal articles, abstracts, manuals, books, or other disclosures) in the Background of the Invention, Detailed Description, and Examples is herein incorporated by reference in their entireties.

The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the subject invention, and are not intended to limit the scope of what is regarded as the invention. Efforts have been made to ensure accuracy with respect to the numbers used (e.g. amounts, temperature, concentrations, etc.) but some experimental errors and deviations should be allowed for. Unless otherwise indicated, parts are parts by weight, molecular weight is average molecular weight; temperature is in degrees centigrade; and pressure is at or near atmospheric.

EXPERIMENTALS Example 1: Generating PHI-4 Genes

Polynucleotides having single codon substitutions compared to the PHI-4 polypeptide of SEQ ID NO: 1 were generated. As described in the examples below, the corresponding PHI-4 polypeptides were expressed, purified and assayed for WCRW insecticidal activity in order to assess the corresponding activity diversity. A reverse mutagenesis primer and a complementary forward mutagenesis primer were designed to create the desired amino acid substitution(s) at the site(s) of interest. Typically the mutagenesis primer was between 30 to 45 bases in length with two or more bases, usually 10 to 15, on both sides of the site of interest. In the case of saturation mutagenesis, degenerated primers that cover all possible amino acid residues were used. Unless otherwise noted, the mutagenic reactions were carried out using Agilent's QuikChange™ Lightening Site-Directed Mutagenesis kit. Materials provided in the kit are QuikChange™ Lightening Enzyme, 10× QuikChange™ Lightning Buffer, dNTP mix, QuikSolution reagent and Dpn1 restriction enzyme according to the manufactures directions.

PCR amplifications were typically carried out with Expand™ High Fidelity PCR system (Roche, Switzerland) in 50 ul containing 50-100 ng templates, 0.4-2 μM primer pair, 200 μM dNTPs and 2 Units of DNA polymerase. The mutagenesis reaction was initiated by pre-heating the reaction mixture to 94° C. for 3 min, followed by 16 cycles of the following cycling program: 94° C. for 1 min, 52° C. for 1 min and 68° C. for 8, 12, 16 or 24 min according to the length of template. The mutagenesis reaction was completed by incubation at 68° C. for 1 h. The PCR-amplification products were evaluated by agarose gel electrophoresis. The PCR products were purified by QIAquick™ PCR purification kit (Qiagen, Germany) and further treated with the restriction enzyme Dpn1. An aliquot of 1 μl of this PCR product was typically transformed into BL21(DE3) cells and inoculated on Luria-Bertani (LB) plate containing 100 pg/ml ampicillin. About 10 colonies in the case of single amino acid mutation or 48 or more colonies for saturation mutagenesis were selected and plasmid DNA was isolated for sequencing. Two step sequencing was used, first for specific mutation site(s) with one sequencing primer followed by full length sequence confirmation with multiple sequencing primers.

Example 2: Purification of MBP::PHI-4 Fusion Polypeptides

Polynucleotides encoding PHI-4 polypeptides were expressed in a modified pMAL vector (New England Bio Lab) as a fusion (i.e. MBP::PHI-4; SEQ ID NO: 6) with MBP (maltose binding protein). The pMAL vector was modified to attach a 6×His tag to the N-terminus of MBP. In order to clone the polynucleotide encoding the MBP::PHI-4 fusion protein (SEQ ID NO: 6), Sph1 and BamH1 sites were engineered in the vector at the cloning site. The polynucleotide (SEQ ID NO: 1) encoding the PHI-4 polypeptide (SEQ ID NO: 2) was amplified with a forward primer (SEQ ID NO: 32) overlapping the Sph1 site and a reverse primer (SEQ ID NO: 33) overlapping the BamH1 site. This PCR product was digested with Sph1 and BamH1 and cloned into pMAL that was precut with the same enzymes. The forward primer was designed such that polynucleotides encoding both MBP and PHI-4 polypeptide (SEQ ID NO: 2) within the MBP::PHI-4 gene (SEQ ID NO:5) were ligated in frame. The plasmid containing the polynucleotide (SEQ ID NO: 5) encoding the MBP::PHI-4 polypeptide (SEQ ID NO: 6) was transformed into E. coli BL21(DE3) cells. The BL21(DE3) cells were grown in MagicMedia™ (Life Technologies) in either 96 deep well plates or flasks in a shaker running at 250 rpm at 37° C. for 8 hrs. followed by 16° C. for 48-60 hrs. During the 16° C. incubation, the MBP::PHI-4 polypeptide fusion protein accumulated in the BL21(DE3) cells as a soluble protein.

In order to purify the MBP::PHI-4 fusion protein (SEQ ID NO: 6), the E. coli cells were harvested by centrifugation and treated in a lysozyme solution consisting of 2 mg/ml lysozyme in 50 ml sodium phosphate buffer at pH 8.0 containing 300 mM NaCl, 2 U/ml endonuclease (Epicentre) and 5 mM MgCl₂ for 3 hrs. at 37° C. with gentle shaking. The lysozyme treated E. coli cells were then disrupted with 1% Triton X100 and clear lysate containing the MBP::PHI-4 proteins were prepared by centrifugation at 4000 rpm, 30 min (96 well plates) or 9000 rpm (flask produced samples). His tagged MBP-PHI-4 polypeptide fusion proteins were purified from the clear lysates by affinity chromatography using NiNTA agarose (catalog #30450; Qiagen) following the manufacturer's standard procedure. For high throughput protein purification, Pall 96 deep well filter plates (Pall Corporation; Catalogue #5051) were used for the affinity chromatography. The purified MPB::PHI-4 polypeptide fusion protein was eluted from NiNTA agarose and passed through Sephadex G25 to change the phosphate buffer to 25 mM HEPES-NaOH, pH 8 and used in insect bioassays for determining the insecticidal activity against Western Corn Rootworm (WCRW). Calipar GXII capillary electrophoresis with a protein chip (Agilent; catalogue #P/N760499) was used to determine the MPB::PHI-4 polypeptide concentrations. The protein analysis was repeated at least 3 times until the final concentration was within the predetermined deviation (less than 10%). Unless otherwise noted, the PHI-4 polypeptides disclosed herein were expressed, purified and assayed for WCRW insecticidal activity as maltose binding protein fusions (i.e. MBP::PHI-4; SEQ ID NO: 6) as described above.

Example 3: Determination of WCRW Insecticidal Activity of MBP::PHI-4 (SEQ ID NO: 6) and MBP::PHI-4-SFR12-004 (SEQ ID NO: 31) Polypeptides

The activity of MBP::PHI-4 (SEQ ID NO: 6) and MBP::PHI-4-SFR12-004 (SEQ ID NO: 31; Example 8) polypeptides against WCRW (western corn rootworm, Diabrotica virgifera virgifera) was determined in an artificial diet feeding assay essentially as described by Cong, R. et al. (Proceedings of the 4th Pacific Rim Conferences on Biotechnology of Bacillus thuringiensis and its environmental impact, pp. 118-123, ed. by R. J. Akhurst, C. E. Beard and P. Hughes, published in 2002, Canberra, Australia). The assays were conducted on an artificial diet containing dilutions of these polypeptides. The MBP::PHI-4-SFR12-004 polypeptide fusion (SEQ ID NO:31) and MBP::PHI-4 (SEQ ID NO: 6) fusions were prepared as above, and 10 μL of protein samples were mixed with 50 μL of molten (40-50° C.) artificial insect diet especially prepared for Diabrotica sp. with low temperature melting agarose, whey protein and wheat germ. The diet-PHI-4 polypeptide mixture was placed in each well of a 96 well micro-titer plate. Four or more neonate WCRW larvae were placed in each well to feed for 4 days at 25° C. and the response of insects towards the proteins was scored using a 0-3 numerical scoring system based on the size and mortality of the larvae in each well. If no response (or normal growth) was seen, a score of 0 was given. When the growth was slightly retarded, a score of 1 was given. A score of 2 meant that the larvae were severely retarded in growth (close to neonate size). A score of 3 meant death to all the larvae in the well. The percent response (% Response) for each treatment was calculated by dividing the total score, a sum of scores from replicated wells for each treatment, by the total highest possible scores and multiplying by 100 to yield “% Response”. For example, if one treatment (one sample, one dose) had 6 replicated wells, the total highest possible score would be 3×6=18. An observed set of scores of 3, 2, 2, 3, 2, 2 for six wells at a given dose for a given variant would result in (14/18)×100=78% Response.

Fast Activity Evaluation (FAE) Analysis:

The PHI-4 polypeptides at two concentrations were assayed along with 4 doses (two-fold dilutions bracketing the EC50) of the reference MBP::PHI-4 fusion protein (SEQ ID NO: 6) within one 96-well assay plate. The concentrations of the PHI-4 polypeptides were within the 4 doses of the reference protein concentrations, preferably around the middle point of the 4 dose concentrations. Each sample plate contained the reference MBP::PHI-4 protein (SEQ ID NO: 6) in a significant number of wells such as 16 wells in 4 separate doses. In each plate, up to 80 MBP::PHI-4 polypeptide variants were included and assayed for activity comparison with the reference PHI-4 polypeptide protein. From a sample plate, 10 ul of samples from each well were picked by multi-channel pipette and dispensed in one assay plate containing 50 ul molten diet in each well and mixed on a shaker. This process of producing the assay plate was repeated as many as 6 times or more to produce a desired number of replicate assay plates. After the diet was solidified and cooled to 4° C., neonate WCRW larvae were placed in each well, the plate was sealed with perforated Mylar film and incubated in a constant temperature incubator at 25° C. After 4 days, the insect responses were scored under a magnifying glass. The sigmoid dose-response values (Responses) were converted to linear probit dose-response values using SAS-JMP®, Generalized Linear Model, Binomial Response, Probit). The response for each protein in replicates was summed, this sum was compared with the probit dose—response line of the activity reference protein and the nominal fold improvement in potency was calculated. This nominal fold improvement estimated for a given dose in a given experiment is defined as the Fast Activity Evaluation Guide Number (FAEGN). For example, if a PHI-4 polypeptide showed a certain % response value at 40 ppm and comparison to the Probit curve indicated that the same response is predicted for the reference protein at 100 ppm, then the FAE Guide Number is 2.5 (100/40). According to this analysis, the PHI-4 polypeptide is nominally 2.5 times more potent than the reference PHI-4 polypeptide protein. The FAE assay was typically done with 2 different doses of PHI-4 polypeptides at a time and performed in three independent experiments, generating 6 FAEGN for each mutant in a typical FAE evaluation. The mean FAEGN is calculated to yield the Mean FAE Index (MFI). As used herein “Mean FAE Index” (MFI) refers to the mean of multiple FAEGN (typically 6 or more); unless otherwise indicated MFI is understood to be an arithmetic mean of FAEGN. For each protein, a two sided t-test was done comparing the multiple FAEGN from the clone with FAEGN values from the reference protein (typically 48-96 FAEGN of SEQ ID NO: 6). The two-sided t-test was done between these 48-96 FAEGN associated with the reference protein and the 6 or more FAEGN associated with the variant of interest. The Bonferroni correction was used to evaluate p-values (number of novel proteins/alpha) for the hypothesis that a given variant is significantly different in insecticidal activity compared to the reference protein, MBP::PHI-4 fusion protein (SEQ ID NO: 6) unless otherwise noted. The Bonferroni correction was used to evaluate p-values (number of novel proteins/alpha) for the hypothesis that a given variant is significantly different in insecticidal activity compared to the reference protein, MBP::PHI-4 fusion protein (SEQ ID NO: 6) unless otherwise noted.

Effective Concentration₅₀ (EC50) evaluation: Variants of particular interest were assayed at higher power in more extensive dose response curves to more accurately estimate the EC50s. The preparation of dose response, infestation, incubation and scoring were as described for the FAE assay format. EC50 determinations were typically carried out with a no insecticidal protein control plus seven two-fold dilutions bracketing the expected EC50 (typically 100 ppm for MBP::PHI-4; SEQ ID NO: 6) and 24 or more replicate wells within a given experiment. As used herein, the EC50 is defined as the predicted point with 50% response in the scoring scheme. It is a combination of growth or feeding Inhibition and lethal responses. In order to determine EC50 values, each treatment (one dose) was repeated 6 or more, usually 24, times.

Data from exemplary FAE and EC50 determinations are given in FIGS. 1 and 2 respectively. MBP::PHI-4-SFR12-004 (SEQ ID NO: 31; Example 8) is a variant with improved insecticidal activity. FIG. 1 shows the primary data for a typical FAE assay. Proteins were purified and quantified as described above. The % response in a typical FAE assay is given on the y axis. The protein concentration (toxin portion of the protein only) is given on the x axis. The geometric mean FAE Index in this instance is 4.0. FIG. 2 shows the data for a typical EC50 measurement. Proteins were purified and quantified as described above. The fractional response (multiply by 100 to get “% Response”) is given on the y axis. The inferred EC50s are 245 ppm (MBP::PHI-1 of SEQ ID NO: 6) and 48 ppm (SFR12-004; SEQ ID NO: 31). The data indicate that the insecticidal activity of MBP::PHI-4-SFR12-004 (SEQ ID NO: 31) is increased relative to that of MBP::PHI-4 (SEQ ID NO: 6).

The MBP::PHI-4 fusion is rapidly cleaved in the presence of insect gut fluid to yield MBP and mature PHI-4 protein and it is believed that the insecticidal activity is due to the cleaved toxin molecules. The MBP::PHI-4 fusion protein (SEQ ID NO:6) was digested with 1/100 (w/w) Factor Xa (New England Biolabs) at 25° C. overnight and the PHI-4 polypeptide was purified by Superdex 200 column chromatography utilizing the size difference and a weak affinity of MBP to Superdex. The MPB::PHI-4 fusion was also cleaved with trypsin and the mature PHI-4 polypeptide (derived from SEQ ID NO: 6) was purified. The mature, purified PHI-4 polypeptide derived from Factor Xa or trypsin cleavage has a MW of ˜60 kDa as measured on SDS-PAGE gels and is fully reactive to polyclonal antisera that also react with the MBP::PHI-4 fusion protein. The EC50 of this “mature” PHI-4 polypeptide fragment is within experimental error of the EC50 of the MBP::PHI-4 parental protein (SEQ ID NO: 6, as calculated on the basis of ppm associated with the toxin fragment and excluding ppm associated with the MBP domain of SEQ ID NO: 6.

Example 4: Single Amino Acid PHI-4 Polypeptide Variants (S1-S4)

A set of 209 PHI-4 polypeptide single amino acid substitution variants spread across both the N and C terminal portions of the protein were made and characterized (Megatable 1; MUT ID: 1-209). The mutagenesis template was the polynucleotide of SEQ ID NO: 5 encoding PHI-4 (SEQ ID NO: 6) as an MBP fusion. The mutations were made using the QuickChange kit (Agilent; Catalogue #200524) essentially as described in Example 1. The particular amino acid substitutions relative to PHI-4 (SEQ ID NO: 2) are as indicated in Megatable 1. For example in Megatable 1, the PHI-4 polypeptide identified as MUT ID: 1 has a valine substituted for the native amino acid alanine at position 202 of PHI-4 (SEQ ID NO: 2) and referred to “A202V”. The polypeptide variant for which activity is reported was prepared as an MBP fusion that is identical to SEQ ID NO: 6 except for this single amino acid substitution. In a similar manner, MUT ID: 1-872 (Megatable 1) are all made in the context of SEQ ID NO: 6; MUT ID: 873-910 (Megatable 1) are made in the context of SEQ ID NO: 8; MUT ID: 911-1135 (Megatable 1) are made in the context of SEQ ID NO: 10. All polypeptides of Megatable 1 were expressed and purified as MBP fusions as described in Example 2. The PHI-4 polypeptides were expressed as MBP fusions and purified as described in Example 2. The assay protocol for WCRW insecticidal activity of the PHI-4 polypeptides was essentially as described for FAE assays in Example 3 using SEQ ID NO: 6 as the reference protein. For the analysis of the data a “Mean Deviation Score” was calculated rather than a mean FAE Index. This is a related metric that is calculated as follows. Data from four two-fold dilutions of MBP::PHI-4 (typically about 25, 50, 100 and 200 ppm final concentration of PHI-4 fragment in artificial insect diet) is plotted on a Probit plot. The difference between response expected for MBP::PHI-4 (SEQ ID NO: 6) at a given dose, based on the Probit curve, and that observed for the mean score for a PHI-4 polypeptide variant at that given dose is calculated and this difference is defined as the Deviation Score. A negative Deviation Score indicates that the response is lower than that which is expected for the relevant parental backbone (typically MBP::PHI-4; SEQ ID NO: 6) at the same concentration and indicates that the variant is nominally less active than the parental backbone. A positive Deviation Score indicates that the variant is nominally more potent than the parental backbone. As used herein, the Mean Deviation Score refers to the arithmetic mean of multiple Deviation Scores which are typically derived from independent experiments. The Mean Deviation Score is used to estimate rank order of activities associated with a set of variants within a given experiment. The Mean Deviation Scores for the 209 variants of this example are given in Megatable 1 and is typically an average of at least three independent Deviation Score measurements.

Example 5: Single Amino Acid Substitution Mutants #2 (SFR)

A BLAST search revealed that AXMI-205 is a bacterial perforin-like protein. Perforin proteins from Clavibactor michiganensis (GenBank Accession number: YP_001223127; SEQ ID NO: 49), Laccaria bicolor (GenBank Accession number: XP_001885969; SEQ ID NO: 48), Marinomonas sp. (GenBank Accession number: ZP_01077945; SEQ ID NO: 38), Nematostella vectensis (GenBank Accession number: XP_001617993; SEQ ID NO: 50), Photorhabdus luminescens (GenBank Accession number: NP_928713; SEQ ID NO: 821), and Serratia proteamaculans (GenBank Accession number: YP_00147861; SEQ ID NO: 820) were found to be homologous to AXMI-205. The N-terminal region of up to 311 amino acid of AXMI-205 (SEQ ID NO: 35) is highly homologous to those perforin proteins. Among those perforin proteins, the 3D X-ray structure of Photorhabdus perforin-like protein has been published (Science 317, 1548-1551, 2007; PDB ID: 2QP2; SEQ ID NO: 822). According to the leading theory of perforin mode of action, the protein inserts 5 alpha helices of its N-terminal region into the target host membrane to form a large size pore (Proc. Natl. Acad. Sci. USA 102, 600-605, 2005; Immunology Today, 16, 194-201, 1995). In the Photorhabdus 2QP2 structure, there are two loops between Alpha C and D and Alpha I and J are considered to be the site for initiating the membrane insertion of those 5 alpha helices. These loops are called membrane insertion initiation loops. The primary sequences of those perforin proteins listed below were aligned using Vector NTI Align X function in order to identify the membrane insertion loop sequences of AXMI-205 (SEQ ID NO: 35). Amino acid sequences of V92-A103 and G211-E220 of SEQ ID NO: 6 were aligned with the membrane insertion initiation loops identified in the 2QP2 structure. A number of acidic, basic and other hydrophilic amino acids were found in the AXMI-205 (SEQ ID NO: 35) membrane insertion loops indicating these loops are exposed to the solvent.

Based on this homology model and prior mutation-activity relationship data, another 664 single amino acid substitution PHI-4 polypeptide variants were made in order to assess the sequence-insecticidal activity-relationships at the selected positions. This set of 664 point mutations (Megatable 1; MUT ID: 210-872) were made with MBP::PHI-1 polypeptide of SEQ ID NO 5 as the DNA template. Mutations were made according to the method of Example 1. The mutagenesis oligonucleotides used to create an exemplary mutant (PHI-4-R97D; SEQ ID NO: 7) are SEQ ID NO: 13 and SEQ ID NO: 14. The other PHI-4 polypeptides (Megatable 1; MUT ID: 210-872) were made in the same manner using mutagenesis oligonucleotides designed to create the selected substitutions at the desired residues of the protein. The resulting PHI-4 polypeptides were purified as MBP fusions and the activity measured in the FAE assay format or EC50 assay format as described in Examples 2 & 3. The Mean FAE Indices associated with these 664 PHI-4 polypeptides are given in lines 210-872 of Megatable 1.

Example 6: SFR16 Point Mutants

Another set of 38 single amino acid substitution PHI-4 polypeptide variants (Megatable 1; MUT ID: 873-910) were made with MBP::PHI-4-R97D (SEQ ID NO 7; Example 5) as the backbone. The substitutions were selected based on the sequence-activity relationships inferred from the PHI-4 polypeptide variants of the preceding Examples. Mutations were made according to the method of Example 1. Variants were purified as MBP fusions and activity measured in FAE assay format as described in Examples 2 & 3. The Mean FAE Indices associated with these mutants are given in lines 873-910 of Megatable 1. The reference for the Mean FAE Index in this example is SEQ ID NO: 8.

Example 7: PSR3 Mutants

The C-terminal domain of AXMI-205 shares significant homology with proteins that have β-prism 3D structural folding patterns. In the typical β-prism, the oligosaccharide binding site is almost always in a cavity formed between the apex and belt-loops from the same Greek-key motif. The primary amino-acid sequence motif has been identified as G-X₃-D (SEQ ID NO: 39) in banana lectin, which has two binding sites (Meagher J L et al Glycobiology 15:1033-42; 2005). The primary sequence of the PHI-4 polypeptide of SEQ ID NO: 2 contains 3 such motifs. The canonical structure of these motifs is indicated in FIG. 3. In addition, the G-X₃-D motif (SEQ ID NO: 39) can be extended toward the N-terminal direction into the D-X-G-[S/T]-G-X₃-D motif (SEQ ID NO: 40), which are present 1, 2 or 3 times in 24 proteins that are orthologous to the C-terminal portion of AXMI-205 (GenBank accession numbers: gi|136474758; gi|136444345; gi|136141087; gi|143658948; gi|142085802; gi|135275135; gi|138446054; gi|294814724; gi|170109524; gi1156316804; gi|156377786; gi|170109526; gi|77456557; gi|1209377; gi|302823768; gi1302532087; gi|256764986; gi|302787479; gi|302823738; gi|169762636; gi|302766657; gi1270056485; gi|302792467; gi|2384884450. Each of the 3 loops in the PHI-4 polypeptide of SEQ ID NO: 2 has potential to bind an oligosaccharide, a putative binding receptor present in WCRW mid-gut cell membrane surface.

As indicated in Megatable 1, 225 PHI-4 polypeptide variants (MUT IDs: 911-1135) were made to introduce an additional amino acid substitution into the PHI-4 polypeptide of PHI-4-D09 (SEQ ID NO: 10) using MBP::PHI-4-D09 (SEQ ID NO: 9) as the DNA template. The PHI-4-D09 backbone contains the following substitutions relative to SEQ ID NO: 2: L401, Y98F, L145V, L163V, 1172L, V3551, and P412A (numbers are relative to the PHI-4 polypeptide backbone of SEQ ID NO: 2. Mutagenesis was done by a modification of the method of Dominy et al (Methods in Molecular Biology, Vol. 235:209-223; 2003). Briefly, “NNK” mutagenesis at position 396 was done as follows. A pMAL vector encoding SEQ ID NO: 9 was amplified by inverse PCR for 20 cycles using SEQ ID NO: 15 & SEQ ID NO: 16. The PCR product was diluted 10-fold, subjected to one additional round of amplification using SEQ ID NO: 16 & SEQ ID NO: 17. The PCR product was purified on QuiaQuick™ column, phosphorylated with T4 polynucleotide kinase, circularized with T4 DNA ligase and transformed into E. coli BL21(DE3) cells. Candidate colonies were amplified by colony PCR and the PCR product was sequenced first with a single primer to confirm the presence of the desired mutation and subsequently sequenced fully with multiple primers to identify clones with no additional mutations. All other PHI-4 polypeptides of this example (Megatable 1, lines 911-1135) were made by a similar manner using mutagenesis oligonucleotides designed to create the selected substitutions at the desired residues of the protein. Positions with multiple desired mutations were made with degenerate forward primers whereas positions with only one desired mutation were made with non-degenerate primers. Clones with the desired sequences were used to express protein essentially as described in Example 2. Protein purification, activity measurements and statistical analysis was done essentially as described in example 3. The Mean FAE Index reflects the fold difference relative to PHI-4 polypeptide of SEQ ID NO: 6. The Mean EC50 of PHI-4-D09 (SEQ ID NO: 10) was measured at high statistical power and is 1.3-fold improved relative to MBP::PHI-4 (SEQ ID NO: 6). PHI-4 polypeptides of this example with Mean FAE Index>1.3× are deemed nominally improved relative to the parental backbone (MBP::PHI-4-D09; SEQ ID NO: 10) and diversity meeting this criterion was used for production of subsequent combinatorial mutants.

Example 8: Identification of Combinatorial Mutants of PHI-4 Polypeptides with Improved Insecticidal Activity as Measured in an Artificial Insect Diet Feeding Assay

Activity diversity identified in Examples 4-7 was used to create 192 combinatorial PHI-4 polypeptide variants (SEQ ID NO: 51-242; Megatable 2). The PHI-4 polypeptide variants were made by sequential point mutagenesis by the method of Example 1. In all cases, the indicated PHI-4 polypeptide variants were made as MBP fusions with the same linker as is indicated in SEQ ID NO 6. The MBP::PHI-4 polypeptides were expressed and purified as indicated in Example 2 & 3. Purified MBP::PHI-4 polypeptides were assayed in FAE assays to derive a Mean FAE Index or in EC50 assays as indicated in Example 3. Thirty-seven exemplary active PHI-4 polypeptide variants with increased Mean FAE Indices are given in Table 3, along with the sequence variation relative to SEQ ID NO: 6. The substitutions relative to PHI-4 polypeptide of SEQ ID NO: 2 are given in the right-most column. All proteins were expressed and purified as MBP fusions proteins. The reference protein for the Mean FAE Index is MPB::PHI-4 (SEQ ID NO: 6). The functional data on all of the PHI-4 polypeptides of this example is given in lines 51-242 of Megatable 2.

TABLE 3 SEQ Ex. ID Mutation List # NO: Alias FAE p value (vs SEQ ID NO: 2) 8 148 SFR11- 27.4 1.36E−13 R097D, K099L, E220D, K289L, 001 R293Q 8 225 SFR17- 20.9 0.001577 D042N, E046N, R097D, K099L, 013 E220D, K289L, R293Q, S333K, G336A, S401H, K402H 8 226 SFR17- 20.1 1.25E−07 D042N, E046N, R097D, K099L, 019 E220D, K289L, R293Q, S333K, G336A, V355I, S401H, K402H, P412A 8 227 SFR17- 19.5 1.48E−09 D042N, E046N, R097D, K099L, 014 E220D, K289L, S333K, G336A, S401H, K402H, P412A 8 228 SFR17- 19.0 5.42E−09 D042N, E046N, R097D, K099L, 011 E220D, K289L, S333K, G336A, S401H, K402H 8 229 SFR17- 17.1 6.82E−10 R097D, K099L, E220D, K289L, 005 R293Q, S401H, K402H, P412A 8 73 SFR10- 17.1 1.48E−19 R097D, S333K, G336A 032 8 230 SFR17- 16.5 7.08E−09 R097D, K099L, E220D, K289L, 018 R293Q, S333K, G336A, V355I, S401H, K402H, P412A 8 149 SFR11- 14.8 8.29E−15 R097D, K099L 012 8 231 SFR17- 13.5 7.19E−16 D042N, E046N, R097D, K099L, 009 E220D, K289L, R293Q, V355I, S401H, K402H, P412A 8 232 SFR17- 13.1 1.63E−10 D042N, E046N, R097D, K099L, 006 E220D, K289L, R293Q, S401H, K402H, P412A 8 233 SFR17- 12.9 5.18E−12 D042N, E046N, R097D, K099L, 016 E220D, K289L, R293Q, S333K, G336A, S401H, P412A 8 150 SFR11- 12.8 2.12E−10 R097D, K099L, E220D, K289L 005 8 74 SFR10- 12.0 7.73E−11 R097D, S401H 042 8 234 SFR17- 11.7 1.82E−07 R097D, K099L, E220D, K289L, 012 R293Q, S333K, G336A, S401H, K402H 8 235 SFR17- 11.3 7.41E−06 D042N, E046N, R097D, K099L, 004 E220D, K289L, S401H, K402H, P412A 8 236 SFR17- 10.5  2.1E−15 D042N, E046N, R097D, K099L, 017 E220D, K289L, S333K, G336A, V355I, S401H, K402H, P412A 8 151 SFR11- 9.9 2.08E−09 R097D, K289L 014 8 237 SFR17- 9.7   1E−04 D042N, E046N, R097D, K099L, 003 E220D, K289L, R293Q, S401H, K402H 8 183 SFR13- 9.3  1.5E−11 R097D, K099L, E220D, K289L, 035 V355I, A396T, P412A, K520E 8 75 SFR10- 8.9 0.003487 R097D, S333V, K520E, Q527K 72  8 238 SFR17- 8.7 0.010038 D042N, E046N, R097D, K099L, 001 E220D, K289L, S401H, K402H 8 76 SFR10- 8.4 2.98E−05 R097D, G462A, R464A, K465M 056 8 77 SFR10- 8.2 1.37E−05 R097D, S333K, G336A, E339N 036 8 239 SFR17- 8.2 4.75E−21 R097D, K099L, E220D, K289L, 015 R293Q, S333K, G336A, S401H, K402H, P412A 8 152 SFR11- 8.2 1.76E−07 R097D, R293Q 015 8 153 SFR11- 8.1  1.7E−06 R097D, E220D, K289L 010 8 240 SFR17- 7.6 5.14E−09 R097D, K099L, E220D, K289L, 002 R293Q, S401H, K402H 8 78 SFR10- 7.2 9.2E−12 R097D, S333V, G336A, S338V 039 8 79 SFR10- 6.7 8.17E−06 R097D, S401H, K402H, K520E, 82  Q527K 8 80 SFR10- 6.6 2.09E−14 R097D, S401H, K402H 045 8 81 SFR10- 6.4 2.15E−06 R097D, S401G, K402H, K520E, 87  Q527K 8 82 SFR10- 6.3 2.89E−08 R097D, G462A, R464K, K465M 060 8 51  SFR5- 6.2 1.65E−05 R097D, R293Q, R416E, K520E 014 8 184 SFR13- 6.1 1.84E−13 R097D, K099L, E220D, K289L, 018 V355I, S401G, P412A, K520E 8 154 SFR11- 5.5 1.54E−09 R097D, E220D 013 8 241 SFR17- 5.5 5.11E−22 D042N, E046N, R097D, K099L, 007 E220D, K289L, V355I, S401H, K402H, P412A

Example 9: Identification of Combinatorial Mutants of PHI-4 Polypeptides with Improved Insecticidal Activity as Measured in an Artificial WCRW Insect Diet Feeding Assay

Activity data from 315 PHI-4 polypeptide combinatorial variants (SEQ ID NOs 243-558) is provided in Megatable 2. Libraries were initially prepared by incorporation of diversity into SEQ ID NO: 5. The diversity was largely derived from that described in Example 4. Oligonucleotides encoding diversity at forty positions were incorporated into SEQ ID NO: 5 in a DNA shuffling reaction essentially as described (Ness et al., Nature Biotechnol. 20, 1251; 2002). Briefly, oligonucleotides typically of 30-45 bases in length encoding the diversity elements of interest were mixed at 0.02-2 micromolar each in the presence of an appropriate concentration of fragments of SEQ ID NO: 5. This reaction was assembled, rescued and cloned essentially as described for synthetic genes in Example 10 and as described (Ness et al., Nature Biotechnol. 20, 1251; 2002). Improved variants from initial libraries were subjected to family DNA shuffling essentially as described (A. Crameri, et al Nature 391, 288; 1998). This family shuffled library was screened by methods similar to those described in Example 3 (FAE) and to those described in Example 4 (Mean Deviation Score). Selected, improved PHI-4 polypeptide variants from the second round of DNA shuffling were further diversified by recombining N terminal and C terminal domains of elite clones using the method of splicing by overlap extension (R. Horton, et al., Gene 77:61-68; 1989) to yield novel variants. All variants were purified by the method of Example 2 and assayed by the method of Mean Deviations of Example 4. PHI-4 polypeptides variants identified are given in Megatable 2 (SEQ ID NO: 243-558).

Example 10: Identification of Combinatorial Mutants of PHI-4 Polypeptides with Improved Insecticidal Activity as Measured in an Artificial WCRW Insect Diet Feeding Assay

A set of 158 PHI-4 polypeptide combinatorial variants (SEQ ID NO: 559-716) were prepared by total gene synthesis, essentially as described by Stemmer et al (Gene 164:49-53; 1995). An additional treatment was implemented as described (Saaem, I. et al, Nucleic Acids Research, Published Nov. 29, 2011, 1-8). Briefly, in a typical gene synthesis reaction a set of oligonucleotides of 120 bases each encoding both top and bottom strands of the target gene were designed. Complementary oligos typically overlap by 54-65 nucleotides. Oligos to make synthetic genes are combined such that a final concentration of each oligo is approximately 0.05-1 uM. Gene assembly is typically done with Herculase II (Agilent) using the following cycling program: 98° C. 3 min followed by (96° C.×30 sec, 40° C.×30 sec, 72° C.×30 sec)×24 cycles. The initial PCR is then used as template for a second PCR in which a second pair of primers is used to amplify the fully synthetic gene product. Typical PCR conditions for the second synthetic gene reaction were 98° C. 3 min followed by (96° C. 30 sec, 50° C. 30 sec, 72° C. 35 sec), ×24 cycles. Reactions were analyzed by 1% E-gels (Invitrogen). A subsequent treatment consisting of a re-annealing step, treatment with Cel1 (Transgenomics; Catalogue #706020) and subsequent amplification (25 cycles) was done essentially as described (Saaem, I. et al, Nucleic Acids Research, Published Nov. 29, 2011, 1-8). A third and final amplification of the synthetic gene was done with similar PCR conditions in a single cycle. The product of this reaction was purified and ligated by Gibson ligation method (New England Biolabs; Catalogue ##E2611 L) to an appropriate vector transformed to chemical competent BL21(DE3) cells. Sequence verified clones (Lines 559-716 of Megatable 2) comprising the PHI-4 polypeptide were expressed as MBP fusions, purified and assayed essentially as described in Examples 2 and 3. Table 4 shows the SEQ ID NOs and substitutions relative to the PHI-4 polypeptide of SEQ ID NO: 2 for twenty active variants. The mean FAE Index is calculated relative to the MBP::PHI-4 backbone (SEQ ID NO: 6).

TABLE 4 SEQ Ex. ID MutationList # NO: Alias FAE p value (vs SEQ ID NO: 2) 10 559 PSR1- 25.6 3.26E−05 D042N, Y098F, L145V, L153I, 1-076 I172L, Y206F, I283V, V355I, G359A, W389L, I410V, A417S 10 560 PSR1- 22.7 4.19E−05 D042N, Y098F, I283V, V355I 1-074 10 561 PSR1- 15.9 7.67E−06 E046N, Y098F, L145V, Y171F, 2-145 I172L, D182Q, E278N, V355I, I410V, A417S, Q442E, V455I 10 562 PSR1- 11.6 1.46E−18 F043E, Y098F, L145V, Y171F, 2-082 I172L, Y206F, E278N, E339Q, V355I, V455I, W457N 10 563 PSR1- 11.0 3.91E−05 F043E, Y098F, L145V, Y171F, 2-088 I172L, Y206F, E278N, V355I, Q442E, V455I, W457N 10 564 PSR1- 10.1 1.07E−10 D042N, F043E, Y098F, L145V, 2-094 Y171F, I172L, E278N, M354L, V355I, V455I, W457N 10 565 PSR1- 9.7 2.8E−08 D042N, F043E, Y098F, L145V, 2-110 Y171F, I172L, E278N, V355I, I410V, Q442E, V455I 10 566 PSR1- 8.5 0.003166 D042N, Y098F, I283V, V355I, 1-073 A417S 10 567 PSR1- 7.2 1.03E−10 D042N, F043E, Y098F, L145V, 2-091 Y171F, I172L, V355I, Q442E, V455I, W457N 10 568 PSR1- 7.0 8.39E−08 F043E, Y098F, L145V, Y171F, 2-149 I172L, Y206F, E278N, V355I, A417S, V455I, W457N 10 569 PSR1- 6.9 3.93E−10 D042N, F043E, Y098F, L145V, 2-087 Y171F, I172L, E278N, V355I, Q442E, V455I, W457N 10 570 PSR1- 6.9 1.01E−07 D042N, R097N, Y098F, L145V, 2-158 Y171F, I172L, V355I, I410V, V455I 10 571 PSR1- 6.7 2.02E−05 E046N, Y098F, L145V, Y171F, 2-085 I172L, D182Q, E278N, V355I, Q442E, V455I, W457N 10 572 PSR1- 6.5 0.010726 D042N, I052V, Y098F, L145V, 1-053 I172L, Y206F, I283V, V355I, H370R, I410V, P412A, A417S, T426S, T461S 10 573 PSR1- 6.2 5.28E−07 D042N, F043E, Y098F, L145V, 2-096 Y171F, I172L, V210I, I283V, M354L, V355I, V455I, W457N 10 574 PSR1- 6.1 2.92E−11 D042N, R097N, Y098F, L145V, 2-135 Y171F, I172L, V355I, H370R, Q442E, V455I 10 575 PSR1- 5.9 0.018276 D042N, R097N, Y098F, L145V, 1-014 I172L, I283V, V355I, I410V, Q442E, V455I 10 575 PSR1- 5.8 5.02E−09 E046N, R097N, Y098F, L145V, 2-141 Y171F, I172L, V355I, Q442E, V455I 10 577 PSR1- 5.6 0.017252 E046N, Y098F, L145V, L163V, 1-006 I172L, Y206F, V210I, E339Q, V355I, A417S 10 578 PSR1- 5.3 7.35E−05 F043E, Y098F, L145V, Y171F, 2-095 I172L, Y206F, E278N, M354L, V355I, V455I, W457N

Example 11: Identification of Combinatorial Mutants of PHI-4 Polypeptides with Improved Insecticidal Activity as Measured in an Artificial Insect Diet Feeding Assay

Sixty-six PHI-4 polypeptide variants (SEQ ID NO: 717-783), containing permutations of a number of substitutions, were made by total gene synthesis, essentially as described in Example 10. The substitutions were done in the context of a backbone (PSR1-2-105; SEQ ID NO: 584 from Megatable 2) containing the following substitutions relative to PHI-4 polypeptide of SEQ ID NO: 2: E46N, R97N, Y98F, L145V, Y171F, 1172L, V355I, I410V, V455I, and W457N. The resulting MBP::PHI-4 polypeptide fusion proteins were expressed, purified, assayed for insecticidal activity on WCRW larvae and analyzed for insecticidal relative to MBP::PHI-4 (SEQ ID NO: 6) using the Mean FAE Index metric as described in Examples 2 & 3. Table 5 shows the Mean FAE Indices, SEQ ID NOs and substitutions relative to PHI-4 polypeptide of SEQ ID NO: 2 for twenty active PHI-4 polypeptide variants of this example. The Mean FAE Index was calculated relative to the MBP::PHI-4 backbone (SEQ ID NO: 6). The insecticidal activities of the PHI-4 polypeptides in Table 5 (Mean FAE Index) reflect the arithmetic means of three independent experiments and are expressed as fold WCRW insecticidal activity improvement of the PHI-4 polypeptide variants relative to MBP::PHI-4 (SEQ ID NO: 6). As indicated, the mean FAE Indices range from 0.26× to >8× (fold improvement relative to MBP::PHI-4). The majority of the PHI-4 polypeptides have increased insecticidal activity relative to MBP::PHI-4 (FAE>1). The p values indicate that the measured differences relative to MBP::PHI-4 (SEQ ID NO: 6) are highly significant.

TABLE 5 SEQ Ex. ID Mutation List # NO: Alias FAE (vs SEQ ID NO: 2) 11 717 PSR7- 10.0 E046N, R097N, Y098F, L145V, 141 Y171F, I172L, V355I, A396K, S401K, D403Y, I410V, V455I, W457N 11 718 PSR7- 8.4 E046N, R097N, Y098F, L145V, 63 Y171F, I172L, V355I, A396L, S401H, K402G, I410V, V455I, W457N 11 719 PSR7- 8.1 E046N, R097N, Y098F, L145V, 89 Y171F, I172L, V355I, A396L, S401H, D403Y, I410V, V455I, W457N 11 720 PSR7- 6.6 E046N, R097N, Y098F, L145V, 94 Y171F, I172L, V355I, A396G, S401K, I410V, V455I, W457N 11 721 PSR7- 6.1 E046N, R097N, Y098F, L145V, 106 Y171F, I172L, V355I, A396K, D403Y, I410V, V455I, W457N 11 722 PSR7- 6.1 E046N, R097N, Y098F, L145V, 96 Y171F, I172L, V355I, A396L, S401H, K402W, D403Y, I410V, V455I, W457N 11 723 PSR7- 5.0 E046N, R097N, Y098F, L145V, 100 Y171F, I172L, V355I, A396K, K402H, I410V, V455I, W457N 11 724 PSR7- 4.7 E046N, R097N, Y098F, L145V, 148 Y171F, I172L, V355I, A396G, S401K, K402G, D403Y, I410V, V455I, W457N 11 725 PSR7- 4.7 E046N, R097N, Y098F, L145V, 98 Y171F, I172L, V355I, A396K, S401H, K402W, D403Y, I410V, V455I, W457N 11 726 PSR7- 4.5 E046N, R097N, Y098F, L145V, 113 Y171F, I172L, V355I, A396K, S401K, I410V, V455I, W457N 11 727 PSR7- 4.5 E046N, R097N, Y098F, L145V, 121 Y171F, I172L, V355I, S401H, I410V, V455I, W457N 11 728 PSR7- 4.5 E046N, R097N, Y098F, L145V, 7 Y171F, I172L, V355I, A396K, S401H, K402H, I410V, V455I, W457N 11 729 PSR7- 4.2 E046N, R097N, Y098F, L145V, 86 Y171F, I172L, V355I, A396K, S401G, K402H, I410V, V455I, W457N 11 730 PSR7- 3.7 E046N, R097N, Y098F, L145V, 155 Y171F, I172L, V355I, S401H, K402G, D403Y, I410V, V455I, W457N 11 731 PSR7- 3.6 E046N, R097N, Y098F, L145V, 116 Y171F, I172L, V355I, S401K, K402W, D403Y, I410V, V455I, W457N 11 732 PSR7- 3.4 E046N, R097N, Y098F, L145V, 95 Y171F, I172L, V355I, S401G, K402H, D403Y, I410V, V455I, W457N 11 733 PSR7- 3.3 E046N, R097N, Y098F, L145V, 90 Y171F, I172L, V355I, A395L, K402G, I410V, V455I, W457N 11 734 PSR7- 3.3 E046N, R097N, Y098F, L145V, 97 Y171F, I172L, V355I, A395G, K402H, I410V, V455I, W457N 11 735 PSR7- 2.8 E046N, R097N, Y098F, L145V, 64 Y171F, I172L, V355I, A396G, S401H, K402W, D403Y, I410V, V455I, W457N 11 736 PSR7- 2.8 E046N, R097N, Y098F, L145V, 93 Y171F, I172L, V355I, S401H, K402W, D403Y, I410V, V455I, W457N

Example 12: Identification of Combinatorial Mutants of PHI-4 Polypeptides with Improved Insecticidal Activity as Measured in an Artificial WCRW Insect Diet Feeding Assay

Thirty six PHI-4 polypeptide variants (Megatable 2; SEQ ID NO: 784-819), containing permutations of eleven substitutions (E220D, G336A, K099L, K289L, K402H, K520E, P412A, R097D, S333K, S401H, V355I; numbering scheme as per SEQ ID NO:2; all variants made as MBP fusions as indicated in Megatable 2), were made using site directed mutagenesis as described in Example 1 and PH1-4 polypeptide variants were expressed, purified, assayed for insecticidal activity on WCRW larvae and analyzed for insecticidal activity relative to PHI-4 polypeptide of SEQ ID NO: 2 as described in Examples 2 & 3. Table 6 shows the Mean FAE Indices, SEQ ID NOs and substitutions relative to the PHI-4 polypeptide of SEQ ID NO: 2 for twenty active PHI-4 polypeptide variants of this example. The mean FAE Index is calculated relative to the MBP::PHI-4 backbone (SEQ ID NO: 6). The mean FAE indices reflect the arithmetic means of three independent experiments. As indicated, the mean FAE indices range from 4.5 to >8. The p values indicate that the measured differences relative to MBP::PHI-4 (SEQ ID NO: 6) are highly significant.

TABLE 6 SEQ Ex. ID MutationList # NO: Alias FAE (vs SEQ ID NO: 2) 12 784 SFR15- 18.8 R097D, K099L, E220D, 009 K289L, S333K, G336A, S401H, K402H, K520E 12 785 SFR15- 8.0 R097D, K099L, E220D, 019 K289L, S401H, K402H, P412A 12 786 SFR15- 7.9 R097D, K099L, E220D, 021 K289L, S333K, G336A, S401H, K402H 12 787 SFR15- 7.9 R097D, K099L, E220D, 033 K289L, S333K, V355I, S401H, K402H 12 788 SFR15- 7.3 R097D, K099L, E220D, 020 K289L, V355I, S401H, K402H, P412A 12 789 SFR15- 7.0 R097D, K099L, E220D, 027 K289L, S333K, V355I, S401H 12 790 SFR15- 6.6 R097D, K099L, E220D, 036 K289L, V355I, K520E 12 791 SFR15- 6.6 R097D, K099L, E220D, 007 K289L, S401H, K402H, P412A, K520E 12 792 SFR15- 5.5 R097D, K099L, E220D, 017 K289L, S401H, K402H 12 793 SFR15- 5.5 R097D, K099L, E220D, 015 K289L, S333K, G336A, P412A 12 794 SFR15- 5.4 R097D, K099L, E220D, 005 K289L, S401H, K402H, K520E 12 795 SFR15- 5.3 R097D, K099L, E220D, 001 K289L, S333K, G336A, K520E 12 796 SFR15- 5.2 R097D, K099L, E220D, 030 K289L, V355I 12 797 SFR15- 5.1 R097D, K099L, S333K, 025 G336A, S401H, K402H, K520E 12 798 SFR15- 5.0 R097D, K099L, E220D, 011 K289L, S333K, G336A, S401H, K402H, P412A, K520E 12 799 SFR15- 4.9 R097D, K099L, E220D, 012 K289L, S333K, G336A, V355I, S401H, K402H, P412A, K520E 12 800 SFR15- 4.7 R097D, K099L, E220D, 029 K289L, S333K, V355I, S401H, P412A 12 801 SFR15- 4.7 R097D, K099L, E220D, 016 K289L, S333K, G336A, V355I, P412A 12 802 SFR15- 4.5 R097D, K099L, E220D, 010 K289L, S333K, G336A, V355I, S401H, K402H, K520E 12 803 SFR15- 4.5 R097D, K099L, E220D, 003 K289L, S333K, G336A, P412A, K520E

Example 13: Accordance Between FAE and EC50 Assays

Experimental data on Mean FAE Index and mean EC50 for twenty-five PHI-4 polypeptide variants is given in FIG. 4. PHI-4 polypeptide variants were first tested in the FAE assay and then selected ones were retested in a multiple EC50 assays. In general, the fold improvement in mean FAE Index is modestly larger than the fold improvement that was subsequently measured in Mean EC50 measurements. This overall trend is as expected from the phenomenon of regression toward the mean (International Journal of Epidemiology 2005; 34:215-220). All 25 PHI-4 polypeptides elected for retesting in EC50 assays repeated as being significantly improved. FIG. 4 shows the EC50 measurements for representative variants from Megatable 2 (SEQ ID NO: 610, SEQ ID NO: 595, SEQ ID NO: 584, SEQ ID NO: 591, SEQ ID NO: 576, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 79, SEQ ID NO: 81, SEQ ID NO: 150, SEQ ID NO: 150, SEQ ID NO: 149, SEQ ID NO: 167, SEQ ID NO: 167, SEQ ID NO: 164, SEQ ID NO: 164, SEQ ID NO: 170, SEQ ID NO: 170, SEQ ID NO: 795, SEQ ID NO: 794, SEQ ID NO: 784, SEQ ID NO: 799, SEQ ID NO: 785, SEQ ID NO: 788, SEQ ID NO: 786, SEQ ID NO: 796, SEQ ID NO: 787).

Example 14: Combinatorial Substitutions

Example 11 yielded numerous PHI-4 polypeptide variants with improved insecticidal activity based on combinatorial substitutions of the three lectin-like motifs described in Example 7. A combinatorial library was prepared of 120 genes based on this diversity as follows. SEQ ID NOs 760 & 761 each contains unique substitutions in loop 1. SEQ ID NOs 717-726; 728-732; 734-737 & 760 contain unique substitutions in loop 2. SEQ ID NOs 761 and 758 contain unique substitutions in loop 3. Gene synthesis was used to create a combinatorial library of these loop sequences essentially as described in Example 10. These genes can be expressed and assayed for activity on WCRW larvae by the methods described above.

Example 15: Mutagenesis of Putative Protease Sensitive Sites of PHI-4 Polypeptides

Trypsin was used to identify the site(s) where proteases possibly attack (Protease Accessible Sites) the PHI-4 polypeptide of SEQ ID NO: 2. The PHI-4 polypeptide of SEQ ID NO: 2 in 50 mM Tris-HCl, pH8 was mixed with 1/50 (weight/weight) trypsin and incubated for 1 hr at 37° C. It was found that protein was relatively resistant to trypsin, with no immediate digestion down to the small fragments, but produced a 55 kDa major band and 24 kDa minor band by SDS-PAGE analysis after the incubation. These two bands were excised from the gel and analyzed by mass spectrometry and N-terminal sequencing. The N-terminal sequencing revealed SAANAGQLGN (amino acids 3-12 of SEQ ID NO: 2) for the 55 kDa protein indicating that only two amino acid residues, Methionine and Alanine were missing from the N-terminal sequence. The mass-spectrometry, however, showed a loss of C-terminal sequence from Ser at 521 to Leu at 536 of SEQ ID NO: 2. This indicates that trypsin digested the PHI-4 polypeptide protein at the C-terminal side of Lys at 520 of SEQ ID NO: 2. The N-terminal sequence of the 24 kDa band was VDKVLLMD (amino acids 314 to 321 of SEQ ID NO: 2). The mass-spectrometry analysis on the 24 kDa fragment confirmed the C-terminal region of PHI-4 polypeptide of SEQ ID NO: 2 starting with Val at 314 relative to SEQ ID NO: 2 as shown by N-terminal sequencing and ending at Lys at 520 relative to SEQ ID NO: 2. This indicated that trypsin digested the PHI-4 polypeptide of SEQ ID NO: 2 at the C-terminal side of Lys at 313 of SEQ ID NO: 2.

From this experiment, it was found that there are at least two protease accessible sites, Lys at 313 and Lys at 520 SEQ ID NO: 6. These two sites were mutated to other amino acid residues by saturation mutagenesis and it was found that mutations at Lys at 313 and Lys at 520 of SEQ ID NO: 2 increase the insecticidal activity significantly. For example, the activity of the PHI-4 polypeptide variant, K313Q (MUT ID: 889), was enhanced 2.3 fold over the activity of PHI-4 polypeptide of SEQ ID NO: 2 as measured in an FAE assay (Megatable 1). The activity of the PHI-4 polypeptide K520Q (MUT ID: 881) was increased 3.1 fold. Activity increases were also found in combinations with other mutations. For example, the activity of the PHI-4 polypeptide having the R097D and K520E substitutions (SEQ ID NO: 52) is 3.5 fold higher than that of PHI-4-R097D (MUT ID: 8) alone by FAE assay.

Example 16: Saturation Mutagenesis of Amino Acid Residues Selected by Site Directed Single Amino Acid Mutagenesis

Certain amino acid residues showed activity changes when mutated by site directed single amino acid mutagenesis. Those residues were “Selected” for saturation mutagenesis. For the purposes of this example, “Selected” can refer to single amino acid mutations that affect the activity, positively or negatively, relative to the parental backbone in which they were made. More specifically substitutions of Megatable 1 with Mean FAE Indices of <0.7 or of >1.3 are deemed “Selected”. For example, Selected amino acid residues were found by performing site directed mutagenesis at certain residues such as Arg and Lys. These basic amino acid residues were mutated to either acidic (Asp, Glu) or neutral, polar (e.g.: Asn and Gln) residues, and the activity of those mutants was determined by the FAE insect assay individually. Acidic amino acid residues such as Asp and Glu were changed to basic (e.g.: Arg, Lys) or neutral, polar (e.g.: Asn and Gln) residues and the mutant activity was determined. Neutral, polar amino acid residues such as Gln and Asn were mutated to either acidic (Asp, Glu) or basic (e.g.: Arg, Lys) amino acid residues to see if the activity of those mutants were changed positively (for example mean FAE Index>1.3 relative to the reference protein) or negatively (for example mean FAE Index <0.7 relative to the reference sequence). Another example of finding Selected amino acid residues is based on the sequence-function relationship. Since AXMI-205 is a member of the perforin family it is possible to identify amino acid residues of PHI-4 polypeptide of SEQ ID NO: 2 which are involved in the mode of action elements such as membrane insertion initiation and receptor binding loops. Amino acid residues found in those regions were considered Selected for saturation mutagenesis in this Example. One can use the alanine scanning to empirically define Selected residues. This technique was used to find additional Selected amino acid residues in the putative receptor binding loops.

After any amino acid residues were determined Selected, those residues were subjected to saturation or near saturation mutagenesis to produce a set of up to 19 mutants for each site (20 all possible amino acids minus the amino acid found in the wild type). The insecticidal activity of all these mutants was determined by the FAE insect assay. Saturation mutagenesis of the Selected amino acid residues, was useful for identifying substitutions with Mean FAE Indices of >1, in many cases >1.3. When the activity of one single amino acid mutation was found to be positive by showing increased activity over the PHI-4 polypeptide of SEQ ID NO: 2, the saturation mutagenesis enabled us to find other mutation(s) that showed further increased activity. For example, while the FAE Index of E082Q (MUT ID: 370) was positive (1.37), the saturation mutagenesis at this site revealed other mutations showing much higher FAE Indices. For example, the index of E0821 (MUT ID: 219) was 7.80 and that of E082L (MUT ID: 259) was 2.71 indicating that PHI-4 polypeptide of SEQ ID NO: 2 hydrophobic residues are beneficial at this site as far as its insecticidal activity is concerned.

Other Selected amino acid substitutions resulted in decreased activity. When these sites were examined further by saturation mutagenesis, substitutions with Mean FAE Indices of >1 were observed. For example, the FAE Indices of K099Q (MUT ID: 677), K099E (MUT ID: 715) were 0.34 and 0.26, respectively. This shows that Lysine at this site is functionally involved in activity and that alternative substitutions may result in improved activity. In this example, the saturation mutagenesis revealed substitutions with Mean FAE Index >1. For example, the substitution K099L (MUT ID: 299) has a Mean FAE Index of 5.72 (Megatable 1). Similar instances were found across the entire PHI-4 polypeptide of SEQ ID NO: 2, for example those indicated in Table 7. Table 7 shows the Mean FAE Indices for nine pairs of substitutions. All data is from Megatable 1.

TABLE 7 Substitution A Substitution B mean mean MUT Sub- FAE MUT Sub- FAE ID stitution Index ID stitution Index 570 K074Q 0.7 215 K074E 12.40 596 E203Q 0.59 288 E203T 2.18 800 R235Q 0.06 497 R235K 1.34 906 K313E 0.14 889 K313Q 2.29 838 D395Q 0.03 832 D395R 1.60 784 S398A 0.11 342 S398Q 1.51 629 K402Q 0.47 216 K402F 10.20 842 D403Q 0.03 251 D403Y 3.03 611 D447Q 0.54 211 D447K 31.70

The serine at position 98 of SEQ ID NO: 2 was Selected by alanine scanning amino acid residues found in a region of the protein that is suspected overlap with the receptor binding loops. Point mutants with improved potency may then be used to prepare and screen combinatorial libraries based on that diversity.

Taking mean FAE Index <0.7 as a definition of Selected, the following positions are deemed Selected: P14, D24, Q38, E53, R55, R61, Q75, D76, E83, E118, E126, D152, R166, K188, K191, D193, K242, P243, R248, D254, L266, D268, A270, D274, D298, K313, D315, K316, D321, V343, S349, Q360, R361, D368, I373, D376, F378, D379, D394, Y404, Q413, N430, Q449, D497, R500, S504. Saturation or near saturation mutagenesis at these positions can be performed by the method of Example 7 or equivalent methods and purified and screened the variant proteins for activity by the methods of Example 2 & 3 or equivalent methods.

Example 17: Transgenic Expression and Activity Evaluation

The PHI-4 polypeptides of SEQ ID NOs 22-25 were cloned under control of the maize ubiquitin promoter (Christensen and Quail, (1996) Transgenic Research 5:213-218) into a standard vector suitable for transformation of maize by Agrobacterium. Transgenic maize plants were produced by the method of Example 20. Selected T0 plants were tested for susceptibility to WCRW feeding by challenging T0 plants with WCRW larvae. After 19-21 days of challenge, the roots were visually examined and root nodal injury scores were recorded as described (Oleson J. et al J. Economic Entomology 98:1-8; 2005). Root nodal injury scores are indicated in FIG. 5. The data support the conclusion that the three PHI-4 polypeptide variants provide measurable in planta efficacy for protection of maize transgenic plants against WCRW. FIG. 5 shows the TO seedlings in the V3-V4 growth stage were challenged as described (Oleson J. et al J. Economic Entomology 98:1-8; 2005) and root nodal injury scores were recorded.

Example 18: In Planta Expression of Fusion Proteins

Localization of the protein can also play an important role in in planta accumulation. One can direct proteins such as PHI-4 polypeptides to the chloroplast using a chloroplast targeting peptide (CTP). Additionally, one can direct expression to the apoplastic space using fusions to peptides such as the barley alpha amylase-derived peptide (BAA; SEQ ID NO: 826). One may also direct transgenically expressed proteins for retention in the endoplasmic reticulum by fusing to both BAA and the sequence “KDEL” (SEQ ID NO: 828). Proteins can also be directed to the vacuolar space by fusion with the C terminal peptide from plant defensins such as the maize defensin 20 C-terminal propeptide (SEQ ID NO: 824). Other functionally equivalent gene elements may be combined in a similar manner. One may also direct expression specifically to the roots with root-specific promoters. Each of these modifications may be made separately or in combination and any given combination of elements to improve accumulation of protein in plant tissue or in functionally improved efficacy of the expressed protein.

Example 19: Transformation of Maize by Particle Bombardment and Regeneration of Transgenic Plants

Immature maize embryos from greenhouse donor plants are bombarded with a DNA molecule containing the PHI-4 polypeptide of nucleotide sequence (e.g., SEQ ID NO: 1) operably linked to an ubiquitin promoter and the selectable marker gene PAT (Wohlleben, et al., (1988) Gene 70: 25-37), which confers resistance to the herbicide Bialaphos. Alternatively, the selectable marker gene is provided on a separate DNA molecule. Transformation is performed as follows. Media recipes follow below.

Preparation of Target Tissue

The ears are husked and surface sterilized in 30% CLOROX™ bleach plus 0.5% Micro detergent for 20 minutes, and rinsed two times with sterile water. The immature embryos are excised and placed embryo axis side down (scutellum side up), 25 embryos per plate, on 560Y medium for 4 hours and then aligned within the 2.5 cm target zone in preparation for bombardment.

Preparation of DNA

A plasmid vector comprising a nucleotide sequence (e.g., SEQ ID NO: 1) operably linked to an ubiquitin promoter is made. For example, a suitable transformation vector comprises a UBI1 promoter from Zea mays, a 5′ UTR from UBI1 and a UBI1 intron, in combination with a PinlI terminator. The vector additionally contains a PAT selectable marker gene driven by a CAMV35S promoter and includes a CAMV35S terminator.

Optionally, the selectable marker can reside on a separate plasmid. A DNA molecule comprising a toxin nucleotide sequence as well as a PAT selectable marker is precipitated onto 1.1 μm (average diameter) tungsten pellets using a CaCl₂) precipitation procedure as follows:

-   -   100 μL prepared tungsten particles in water     -   10 μL (1 μg) DNA in Tris EDTA buffer (1 μg total DNA)     -   100 μL 2.5 M CaC1₂     -   10 μL 0.1 M spermidine

Each reagent is added sequentially to a tungsten particle suspension, while maintained on the multitube vortexer. The final mixture is sonicated briefly and allowed to incubate under constant vortexing for 10 minutes. After the precipitation period, the tubes are centrifuged briefly, liquid removed, washed with 500 mL 100% ethanol, and centrifuged for 30 seconds. Again the liquid is removed, and 105 μL 100% ethanol is added to the final tungsten particle pellet. For particle gun bombardment, the tungsten/DNA particles are briefly sonicated and 10 μL spotted onto the center of each macrocarrier and allowed to dry about 2 minutes before bombardment.

Particle Gun Treatment

The sample plates are bombarded at level #4 in particle gun #HE34-1 or #HE34-2. All samples receive a single shot at 650 PSI, with a total of ten aliquots taken from each tube of prepared particles/DNA.

Subsequent Treatment

Following bombardment, the embryos are kept on 560Y medium for 2 days, then transferred to 560R selection medium containing 3 mg/liter Bialaphos, and subcultured every 2 weeks. After approximately 10 weeks of selection, selection-resistant callus clones are transferred to 288J medium to initiate plant regeneration. Following somatic embryo maturation (2-4 weeks), well-developed somatic embryos are transferred to medium for germination and transferred to the lighted culture room. Approximately 7-10 days later, developing plantlets are transferred to 272V hormone-free medium in tubes for 7-10 days until plantlets are well established. Plants are then transferred to inserts in flats (equivalent to 2.5″ pot) containing potting soil and grown for 1 week in a growth chamber, subsequently grown an additional 1-2 weeks in the greenhouse, then transferred to classic 600 pots (1.6 gallon) and grown to maturity. Plants are monitored and scored for expression of the toxin by assays known in the art or as described above.

Bombardment and Culture Media

Bombardment medium (560Y) comprises 4.0 g/L N6 basal salts (SIGMA C-1416), 1.0 mL/L Eriksson's Vitamin Mix (1000×SIGMA-1511), 0.5 mg/L thiamine HCl, 120.0 g/L sucrose, 1.0 mg/L 2,4-D and 2.88 g/L L-proline (brought to volume with deionized H₂O following adjustment to pH 5.8 with KOH); 2.0 g/L Gelrite™ (added after bringing to volume with dl H₂O); and 8.5 mg/L silver nitrate (added after sterilizing the medium and cooling to room temperature). Selection medium (560R) comprises 4.0 g/L N6 basal salts (SIGMA C-1416), 1.0 mL/L Eriksson's Vitamin Mix (1000× SIGMA-1511), 0.5 mg/L thiamine HCl, 30.0 g/L sucrose, and 2.0 mg/L 2,4-D (brought to volume with dl H₂O following adjustment to pH 5.8 with KOH); 3.0 g/L Gelrite™ (added after bringing to volume with dl H₂O); and 0.85 mg/L silver nitrate and 3.0 mg/L Bialaphos (both added after sterilizing the medium and cooling to room temperature).

Plant regeneration medium (288J) comprises 4.3 g/L MS salts (GIBCO 11117-074), 5.0 mL/L MS vitamins stock solution (0.100 g nicotinic acid, 0.02 g/L thiamine HCl, 0.10 g/L pyridoxine HCl, and 0.40 g/L Glycine brought to volume with polished D-I H₂O) (Murashige and Skoog, (1962) Physiol. Plant. 15:473), 100 mg/L myo-inositol, 0.5 mg/L zeatin, 60 g/L sucrose, and 1.0 mL/L of 0.1 mM abscisic acid (brought to volume with polished dl H₂O after adjusting to pH 5.6); 3.0 g/L Gelrite™ (added after bringing to volume with dl H₂O); and 1.0 mg/L indoleacetic acid and 3.0 mg/L Bialaphos (added after sterilizing the medium and cooling to 60 C).

Hormone-free medium (272V) comprises 4.3 g/L MS salts (GIBCO 11117-074), 5.0 mL/L MS vitamins stock solution (0.100 g/L nicotinic acid, 0.02 g/L thiamine HCl, 0.10 g/L pyridoxine HCl, and 0.40 g/L Glycine brought to volume with polished dl H₂O), 0.1 g/L myo-inositol, and 40.0 g/L sucrose (brought to volume with polished dl H₂O after adjusting pH to 5.6); and 6 g/L Bacto-agar (added after bringing to volume with polished dl H₂O), sterilized and cooled to 60° C.

Example 20: Agrobacterium-Mediated Transformation of Maize and Regeneration of Transgenic Plants

For Agrobacterium-mediated transformation of maize with a toxin nucleotide sequence (e.g., SEQ ID NO: 1), the method of Zhao can be used (U.S. Pat. No. 5,981,840 and PCT Patent Publication Number WO 1998/32326; the contents of which are hereby incorporated by reference). Briefly, immature embryos are isolated from maize and the embryos contacted with a suspension of Agrobacterium under conditions whereby the bacteria are capable of transferring the nucleotide sequence (e.g. SEQ ID NO: 1) to at least one cell of at least one of the immature embryos (step 1: the infection step). In this step the immature embryos can be immersed in an Agrobacterium suspension for the initiation of inoculation. The embryos are co-cultured for a time with the Agrobacterium (step 2: the co-cultivation step). The immature embryos can be cultured on solid medium following the infection step. Following this co-cultivation period an optional “resting” step is contemplated. In this resting step, the embryos are incubated in the presence of at least one antibiotic known to inhibit the growth of Agrobacterium without the addition of a selective agent for plant transformants (step 3: resting step). The immature embryos can be cultured on solid medium with antibiotic, but without a selecting agent, for elimination of Agrobacterium and for a resting phase for the infected cells. Next, inoculated embryos are cultured on medium containing a selective agent and growing transformed callus is recovered (step 4: the selection step). The immature embryos are cultured on solid medium with a selective agent resulting in the selective growth of transformed cells. The callus is then regenerated into plants (step 5: the regeneration step), and calli grown on selective medium can be cultured on solid medium to regenerate the plants.

Megatable Legends

Megatable 1. The definitions of the column headings are as follows: “MUT ID”, a unique identifier for each substitutions; “Backbone”, the SEQ ID corresponding to the polypeptide backbone in which the substitution was made; “Position”, amino acid position according to the numbering convention of SEQ ID NO: 2, “Ref. A. A.”, the standard single letter code for the amino acid present in the backbone sequence at the indicated position; “Substitution”, the standard single letter code for the amino acid present in the mutant sequence at the indicated position; “FAE”, the arithmetic Mean FAE Index as further defined in Example 3; “p-value” the calculated p value associated with the hypothesis that the variant polypeptide is significantly different than the reference protein used in that particular FAE assay, as defined further in Example 3; “EC50 (ppm)”, EC50 as defined in example 3 with the EC50 dose given in ppm for the toxin portion of the sample; “Deviation”, Mean Deviation Score as defined in Example 4; “Example #”, the example number corresponding to the creation of the variant. The reference protein against which the variant protein is compared is: (MUT IDs: 1-872 and 911-1135) used SEQ ID NO: 6 as the reference protein; (MUT IDs: 873-910) used SEQ ID NO: 8 as the reference protein. Megatable 2. The definitions of the column headings are as follows: “SEQ ID NO:”, a unique identifier for each DNA or amino acid sequence; “Trivial Name”, a trivial but unique name for each DNA or protein sequence; “Reference Protein”, the SEQ ID NO corresponding to the reference protein used in the FAE assays for each variant; “FAE”, the arithmetic Mean FAE Index as further defined in Example 3; “p-value” the calculated p value associated with the hypothesis that the variant polypeptide is significantly different than the reference protein used in that particular FAE assay, as defined further in Example 3; “EC50 Fold”, the fold increase in potency as defined by ratio of reference EC50 to variant EC50 is calculated in each of multiple experiments and then the mean of these independently measured numbers is reported as “EC50 Fold”; “Deviation”, Mean Deviation Score as defined in Example 4; “Example #”, the example number corresponding to the creation of the variant. The reference protein against which the variant protein is compared is: (MUT IDs: 1-872 and 911-1135) used SEQ ID NO: 6 as the reference protein; (MUT IDs: 873-910) used SEQ ID NO: 8 as the reference protein. 

That which is claimed:
 1. A PHI-4 polypeptide having improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35).
 2. The PHI-4 polypeptide of claim 1, wherein the PHI-4 polypeptide comprises an amino acid sequence of the formula, (SEQ ID NO: 3)                 5                   10                  15 Met Xaa Ser Ala Ala Asn Ala Gly Xaa Leu Gly Asn Leu Xaa Gly                 20                  25                  30 Xaa Thr Ser Xaa Gly Met Xaa Tyr Xaa Val Asn Gly Leu Tyr Ala                 35                  40                  45 Ser Pro Glu Ser Leu Xaa Gly Gln Pro Leu Phe Xaa Xaa Gly Gly                 50                  55                  60 Xaa Leu Asp Ser Xaa Xaa Ile Glu Gly Xaa Xaa Xaa Xaa Phe Pro                 65                  70                  75 Xaa Ser Met His Val His Thr Tyr Phe His Ser Asp Xaa Xaa Gln                 80                  85                  90 Xaa Val Ser Xaa Xaa Ile Xaa Xaa Xaa Arg Xaa Xaa Xaa Ser Xaa                 95                  100                 105 His Val Gly Xaa Ser Gly Xaa Xaa Xaa Leu Phe Ser Xaa Ser Xaa                 110                 115                 120 Ser Val Asp Xaa Thr Thr Xaa Xaa Gln Gln Leu Xaa Glu Ile Thr                 125                 130                 135 Xaa Ser Ser Thr Arg Glu Xaa His Val Leu Trp Tyr Ile Ser Leu                 140                 145                 150 Pro Gly Ala Ala Thr Leu Xaa Ser Met Leu Xaa Xaa Xaa Xaa Xaa                 155                 160                 165 Xaa Asp Xaa Xaa Xaa Pro Asn Met Xaa Ala Met Xaa Leu Phe Xaa                 170                 175                 180 Xaa Xaa Gly Pro Xaa Xaa Xaa Xaa Xaa Ala Ala Val Gly Gly Arg                 185                 190                 195 Leu Xaa Xaa Xaa Xaa Ala Ser Lys Xaa Leu Xaa Met Xaa Ser Ser                 200                 205                 210 Xaa Ser Leu Ser Thr Thr Xaa Xaa Xaa Ser Xaa Xaa Ala Xaa Xaa                 215                 220                 225 Gly Glu Ile Xaa Ile Xaa His Gly Ser Xaa Met Glu Lys Gln Val                 230                 235                 240 Asn Ser Phe Xaa Xaa Xaa Ser Thr Ile Arg Xaa Thr Ala Thr Gly                 245                 250                 255 Gly Lys Pro Gly Xaa Thr Xaa Arg Ile Leu His Gly Pro Asp Ser                 260                 265                 270 Xaa Xaa Ala Phe Ser Xaa Trp Ala Xaa Ser Leu Leu Xaa Tyr Ala                 275                 280                 285 Thr Leu Met Asp Phe Xaa Thr Xaa Ser Leu Xaa Xaa Ile Xaa Ala                 290                 295                 300 Leu Xaa Asp Xaa Pro Xaa Xaa Xaa Xaa Glu Xaa Xaa Xaa Ala Xaa                 305                 310                 315 Pro Xaa Xaa Met Xaa Xaa Ser Gln Xaa Ser Ile Pro Xaa Val Asp                 320                 325                 330 Xaa Val Leu Leu Met Asp Ala Arg Pro Pro Met Val Xaa Ala Gly                 335                 340                 345 Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa                 350                 355                 360 Xaa Ser Thr Ser Xaa Xaa Tyr Lys Xaa Xaa Gly Gln Phe Xaa Gln                 365                 370                 375 Arg Xaa His Xaa Ser Val Ala Asp Gly His Xaa Pro Ile Xaa Xaa                 380                 385                 390 Asp Leu Phe Asp Xaa Gly Xaa Xaa Xaa Xaa Pro Val Gly Xaa Gln                 395                 400                 405 Xaa Val Trp Asp Xaa Xaa Xaa Xaa Gly Lys Xaa Xaa Xaa Tyr Xaa                 410                 415                 420 Cys Trp Arg Xaa Xaa Xaa Xaa Gln Gly Tyr Xaa Xaa Xaa Gly Asp                 425                 430                 435 Val Xaa Met Leu Ala Xaa Ser Gly Tyr Asn Pro Pro Asn Leu Pro                 440                 445                 450 Xaa Xaa Xaa Cys Xaa His Xaa Ser Leu Xaa Ala Xaa Xaa Xaa Thr                 455                 460                 465 Leu Xaa Xaa Xaa Xaa Trp Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa                 470                 475                 480 Xaa Val Ser Leu Trp Xaa Pro Gly Ala Ala Gly Ala Val Ala Ser                 485                 490                 495 Ser Cys Phe Ala Gly Val Pro Asn Tyr Asn Asn Pro Pro Asn Ser                 500                 505                 510 Gly Xaa Ile Xaa Xaa Leu Xaa Gly Ser Ile Ala Cys Val Xaa Thr                 515                 520                 525 Ser Ala Ile Ala Ser Met Xaa Xaa Met Xaa Ser Met Leu Ser Xaa                 530                 535 His Xaa Gly Met Glu Ala Met Met Ser Lys Leu,

wherein Xaa at position 2 is Ala or Arg; Xaa at position 9 is Gln, Lys or Glu; Xaa at position 14 is Pro or Ala; Xaa at position 16 is Val or Asp; Xaa at position 19 is Met or Leu; Xaa at position 22 is Gly or Ser; Xaa at position 24 is Asp, Asn or Gln; Xaa at position 36 is Leu or Met; Xaa at position 42 is Asp, Asn or Gln; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu, Asp, Asn or Gly; Xaa at position 50 is Ile or Val; Xaa at position 51 is Glu or Gln; Xaa at position 55 is Arg or Lys; Xaa at position 56 is Ser or Thr; Xaa at position 57 is Tyr or Phe; Xaa at position 58 is Thr or Ser; Xaa at position 61 is Arg, Lys or Glu; Xaa at position 73 is Phe or Tyr; Xaa at position 74 is Lys, Glu, Gly, Arg, Met, Leu, His or Asp; Xaa at position 76 is Asp or Gln; Xaa at position 79 is Lys or Glu; Xaa at position 80 is Glu or Ser; Xaa at position 82 is Glu, Ile, Leu, Tyr or Gln; Xaa at position 83 is Glu or Gln; Xaa at position 84 is Tyr or Phe; Xaa at position 86 is Glu or Gln; Xaa at position 87 is Lys or Gln; Xaa at position 88 is Met, Ile or Leu; Xaa at position 90 is Gln or Glu; Xaa at position 94 is Val or Ile; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln, Gly or Ser; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile, Met, Phe, Cys, Val or Asn; Xaa at position 103 is Ala or Gly; Xaa at position 105 is Leu or Ile; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp, Asn, Glu, Cys, Ala or Arg; Xaa at position 112 is Thr or Ser; Xaa at position 113 is Asp, Glu or Met; Xaa at position 117 is Thr or Ser; Xaa at position 121 is Tyr or Phe; Xaa at position 127 is Ala or Thr; Xaa at position 142 is Arg or Glu; Xaa at position 146 is Arg or Gln; Xaa at position 147 is Arg, Glu or Gln; Xaa at position 148 is Asp, Phe, Pro, Val, Glu, His, Trp, Ala, Arg, Leu, Ser, Gln or Gly; Xaa at position 149 is Phe or Val; Xaa at position 150 is Arg, Gln or Glu; Xaa at position 151 is Asp, Ser, Ala, Asn, Trp, Val, Gln, Cys, Met, Leu, Arg or Glu; Xaa at position 153 is Leu or Ile; Xaa at position 154 is Asn or Asp; Xaa at position 155 is Asn or Lys; Xaa at position 159 is Pro or Asp; Xaa at position 162 is Glu, Asp or Gln; Xaa at position 165 is Lys, Glu, Gln, Pro, Thr, Ala, Leu, Gly, Asp, Val, His, Ile, Met, Trp, Phe, Tyr or Arg; Xaa at position 166 is Arg or Gln; Xaa at position 167 is Tyr, Trp or Cys; Xaa at position 170 is Tyr or His; Xaa at position 171 is Tyr or Phe; Xaa at position 172 is Ile, Leu or Val; Xaa at position 173 is Ser or Ala; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 183 is Tyr or Val; Xaa at position 184 is Ser or Thr; Xaa at position 185 is Ala or Ser; Xaa at position 189 is Thr, Lys or Ile; Xaa at position 191 is Lys or Gln; Xaa at position 193 is Asp or Asn; Xaa at position 196 is Gln, Lys, Asn, Asp, Glu, Ala, Ile or Arg; Xaa at position 202 is Ala or Val; Xaa at position 203 is Glu, Thr or His; Xaa at position 204 is Met or Ala; Xaa at position 206 is Tyr or Phe; Xaa at position 207 is Lys or Gln; Xaa at position 209 is Leu or Pro; Xaa at position 210 is Val or Ile; Xaa at position 214 is Lys, Ser or Gln; Xaa at position 216 is Glu, Gln, Phe, Val, Tyr or Arg; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser, Gln, Arg, Trp, Met, Ala, Phe, Ile, Leu, Cys or Asn; Xaa at position 229 is Arg or Glu; Xaa at position 230 is Ser or Glu; Xaa at position 231 is Asn or Ser; Xaa at position 236 is Leu or Pro; Xaa at position 245 is Met or Leu; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln, Lys or Glu; Xaa at position 257 is Gln, Ile, Glu, Cys, Ser, His, Trp or Met; Xaa at position 261 is Gln, Glu or Lys; Xaa at position 264 is Glu or Gln; Xaa at position 268 is Asp or Asn; Xaa at position 276 is Ser or Ala; Xaa at position 278 is Glu, Asn or Gln; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 282 is Pro or Gly; Xaa at position 284 is Trp or Arg; Xaa at position 287 is Ala or Cys; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr, Asp, Phe, Ser, Met, Arg, Trp, Ile, His, Asn, Cys, Gly or Ala; Xaa at position 291 is Glu or Gln; Xaa at position 292 is Arg or Gln; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 294 is Val or Ala; Xaa at position 296 is Leu or Ile; Xaa at position 297 is Glu or Gln; Xaa at position 298 is Asp or Gln; Xaa at position 300 is Phe or Tyr; Xaa at position 302 is Glu or Gln; Xaa at position 303 is Phe or Tyr; Xaa at position 305 is Lys or Gln; Xaa at position 306 is Gln or Lys; Xaa at position 309 is Gln, Lys or Glu; Xaa at position 313 is Lys, Gln or Arg; Xaa at position 316 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 331 is Glu, Asn or Gln; Xaa at position 333 is Ser, Arg, Gly, Lys, Val, Asn, Ala, His, Gln, Thr, Asp, Ile, Leu, Cys or Glu; Xaa at position 334 is Gly, Arg, Lys, Ile or Trp; Xaa at position 335 is Ser or Ala; Xaa at position 336 is Gly or Ala; Xaa at position 337 is Ala, Val or Gly; Xaa at position 338 is Ser, His, Val, Lys, Ala, Gly, Thr, Ile, Glu, Met, Arg, Pro, Asp, Asn or Leu; Xaa at position 339 is Glu, Asn, Gln, Ile, Pro, Met, Ser, Ala, Cys, Phe, Val, Leu, Asp, Trp, His or Arg; Xaa at position 341 is Leu or Val; Xaa at position 342 is Ala, Ser or Val; Xaa at position 343 is Val or Ile; Xaa at position 344 is Phe or Trp; Xaa at position 345 is Asn or His; Xaa at position 346 is Pro or Ala; Xaa at position 350 is Asn or Ser; Xaa at position 351 is Gly or Val; Xaa at position 354 is Met or Leu; Xaa at position 355 is Val, Ile or Leu; Xaa at position 359 is Gly or Ala; Xaa at position 362 is Asn or Ser; Xaa at position 364 is Ala or Ser; Xaa at position 371 is Ala, Gly or Thr; Xaa at position 374 is Phe or Ile; Xaa at position 375 is Lys or Arg; Xaa at position 380 is Leu or Gly; Xaa at position 382 is Val, Asp or Leu; Xaa at position 383 is Leu, Ile or Val; Xaa at position 384 is Lys, Ala or Gly; Xaa at position 385 is Ala or Gly; Xaa at position 389 is Trp or Tyr; Xaa at position 391 is Arg, Leu, Glu, Gln or Asp; Xaa at position 395 is Asp or Cys; Xaa at position 396 is Ala, Leu, Lys, Asn, Gly, Ile, Met, Arg, Tyr, Gln or His; Xaa at position 397 is Gly, Arg or Ala; Xaa at position 398 is Ser, Gln or Cys; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val, Arg, Ile, Asn, Phe, Thr, Ala, Asp, Met, Gln or Glu; Xaa at position 402 is Lys, Phe, His, Arg, Trp, Gly, Asn, Leu, Tyr, Thr, Val, Met, Pro or Ala; Xaa at position 403 is Asp, Tyr, Trp, Phe or Glu; Xaa at position 405 is Ala or Ser; Xaa at position 409 is Ala or Pro; Xaa at position 410 is Ile or Val; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg, Glu or Gln; Xaa at position 417 is Ala, Ser or Cys; Xaa at position 418 is Leu or Met; Xaa at position 422 is Met or Val; Xaa at position 426 is Thr or Ser; Xaa at position 436 is Asp or Lys; Xaa at position 437 is Tyr or Val; Xaa at position 438 is Val or Arg; Xaa at position 440 is Val or Leu; Xaa at position 442 is Gln, Lys or Glu; Xaa at position 445 is Cys, Leu or Thr; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu, Ile, Gly, Pro, Leu, Phe, Trp or Thr; Xaa at position 448 is Val or Ala; Xaa at position 449 is Gln or Glu; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 453 is Asn or Asp; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp, Gln, Gly, His, Asp, Leu, Thr, Pro or Asn; Xaa at position 455 is Val or Ile; Xaa at position 457 is Trp or Asn; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile, Thr, Ser, His, Cys, Tyr, Pro, Asn, Ala, Arg or Glu; Xaa at position 460 is Gly or Ala; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala, Ser or Gly; Xaa at position 464 is Arg, Gly, His, Gln, Thr or Phe; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly, Arg, Thr, His, Cys, Trp, Phe or Leu; Xaa at position 466 is Asp or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 497 is Asp or Gln; Xaa at position 499 is Glu or Gln; Xaa at position 500 is Arg, Gln or Lys; Xaa at position 502 is Arg, Glu or Gln; Xaa at position 509 is Lys, Gln or Glu; Xaa at position 517 is Gln, Cys, Asn, Val or Pro; Xaa at position 518 is Glu or Gln; Xaa at position 520 is Lys, Gln or Glu; Xaa at position 525 is Gln or Lys; and Xaa at position 527 is Gln, Lys, Pro, Cys, Glu, Ser, His, Phe or Trp; wherein one or more amino acid(s) designated by Xaa in SEQ ID NO: 3 is an amino acid different from the corresponding amino acid of SEQ ID NO: 35 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO:
 35. 3. The PHI-4 polypeptide of claim 1, wherein the PHI-4 polypeptide comprises an amino acid sequence of the formula (SEQ ID NO: 4)                 5                   10                  15 Met Xaa Ser Ala Ala Asn Ala Gly Gln Leu Gly Asn Leu Pro Gly                 20                  25                  30 Val Thr Ser Met Gly Met Gly Tyr Xaa Val Asn Gly Leu Tyr Ala                 35                  40                  45 Ser Pro Glu Ser Leu Leu Gly Gln Pro Leu Phe Xaa Xaa Gly Gly                 50                  55                  60 Xaa Leu Asp Ser Ile Glu Ile Glu Gly Arg Ser Tyr Thr Phe Pro                 65                  70                  75 Arg Ser Met His Val His Thr Tyr Phe His Ser Asp Phe Xaa Gln                 80                  85                  90 Asp Val Ser Xaa Glu Ile Xaa Glu Tyr Arg Glu Lys Met Ser Gln                 95                  100                 105 His Val Gly Val Ser Gly Xaa Xaa Xaa Leu Phe Ser Ala Ser Leu                 110                 115                 120 Ser Val Asp Xaa Thr Thr Thr Asp Gln Gln Leu Thr Glu Ile Thr                 125                 130                 135 Tyr Ser Ser Thr Arg Glu Ala His Val Leu Trp Tyr Ile Ser Leu                 140                 145                 150 Pro Gly Ala Ala Thr Leu Arg Ser Met Leu Arg Xaa Xaa Phe Xaa                 155                 160                 165 Xaa Asp Xaa Asn Asn Pro Asn Met Pro Ala Met Xaa Leu Phe Xaa                 170                 175                 180 Xaa Tyr Gly Pro Tyr Xaa Ile Ser Xaa Ala Ala Val Gly Gly Arg                 185                 190                 195 Leu Xaa Tyr Ser Ala Ala Ser Lys Thr Leu Lys Met Asp Ser Ser                 200                 205                 210 Xaa Ser Leu Ser Thr Thr Ala Xaa Met Ser Xaa Lys Ala Leu Val                 215                 220                 225 Gly Glu Ile Lys Ile Xaa His Gly Ser Xaa Met Glu Lys Gln Val                 230                 235                 240 Asn Ser Phe Arg Ser Asn Ser Thr Ile Arg Leu Thr Ala Thr Gly                 245                 250                 255 Gly Lys Pro Gly Met Thr Xaa Arg Ile Leu His Gly Pro Asp Ser                 260                 265                 270 Xaa Xaa Ala Phe Ser Xaa Trp Ala Glu Ser Leu Leu Asp Tyr Ala                 275                 280                 285 Thr Leu Met Asp Phe Ser Thr Xaa Ser Leu Xaa Pro Ile Trp Ala                 290                 295                 300 Leu Ala Asp Xaa Pro Glu Arg Xaa Val Glu Leu Glu Asp Ala Phe                 305                 310                 315 Pro Glu Phe Met Lys Gln Ser Gln Gln Ser Ile Pro Xaa Val Asp                 320                 325                 330 Lys Val Leu Leu Met Asp Ala Arg Pro Pro Met Val Xaa Ala Gly                 335                 340                 345 Glu Asp Xaa Xaa Ser Xaa Ala Xaa Xaa Asp Leu Ala Xaa Phe Asn                 350                 355                 360 Xaa Ser Thr Ser Asn Gly Tyr Lys Met Xaa Gly Gln Phe Xaa Gln                 365                 370                 375 Arg Asn His Ala Ser Val Ala Asp Gly His Ala Pro Ile Phe Lys                 380                 385                 390 Asp Leu Phe Asp Leu Gly Val Leu Lys Ala Pro Val Gly Trp Gln                 395                 400                 405 Xaa Val Trp Asp Asp Xaa Gly Ser Gly Lys Xaa Xaa Xaa Tyr Ala                 410                 415                 420 Cys Trp Arg Ala Ile Xaa Xaa Gln Gly Tyr Xaa Xaa Xaa Gly Asp                 425                 430                 435 Val Met Met Leu Ala Xaa Ser Gly Tyr Asn Pro Pro Asn Leu Pro                 440                 445                 450 Asp Tyr Val Cys Xaa His Gln Ser Leu Cys Ala Xaa Val Gln Thr                 455                 460                 465 Leu Xaa Asn Xaa Xaa Trp Trp Asp Xaa Gly Xaa Xaa Xaa Xaa Xaa                 470                 475                 480 Asp Val Ser Leu Trp Xaa Pro Gly Ala Ala Gly Ala Val Ala Ser                 485                 490                 495 Ser Cys Phe Ala Gly Val Pro Asn Tyr Asn Asn Pro Pro Asn Ser                 500                 505                 510 Gly Asp Ile Glu Xaa Leu Arg Gly Ser Ile Ala Cys Val Xaa Thr                 515                 520                 525 Ser Ala Ile Ala Ser Met Gln Glu Met Xaa Ser Met Leu Ser Gln                 530                 535 His Xaa Gly Met Glu Ala Met Met Ser Lys Leu,

wherein Xaa at position 2 is Ala or Arg; Xaa at position 24 is Asp or Asn; Xaa at position 42 is Asp or Asn; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu or Asn; Xaa at position 74 is Lys, Glu or Gly; Xaa at position 79 is Lys or Glu; Xaa at position 82 is Glu, Ile, Leu or Tyr; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln or Gly; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile or Met; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp or Asn; Xaa at position 147 is Arg or Glu; Xaa at position 148 is Asp, Phe or Pro; Xaa at position 150 is Arg or Gln; Xaa at position 151 is Asp, Ser, Ala or Asn; Xaa at position 153 is Leu or Ile; Xaa at position 162 is Glu or Gln; Xaa at position 165 is Lys, Glu or Gln; Xaa at position 166 is Arg or Gln; Xaa at position 171 is Tyr or Phe; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 196 is Gln, Lys, Asn or Asp; Xaa at position 203 is Glu, Thr or His; Xaa at position 206 is Tyr or Phe; Xaa at position 216 is Glu or Gln; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser or Gln; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln or Lys; Xaa at position 257 is Gln or Ile; Xaa at position 261 is Gln or Glu; Xaa at position 278 is Glu or Asn; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr or Asp; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 313 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 333 is Ser, Gly, Lys, Val or Asn; Xaa at position 334 is Gly, Arg, Lys or Ile; Xaa at position 336 is Gly or Ala; Xaa at position 338 is Ser, His, Val, Lys or Ala; Xaa at position 339 is Glu, Asn, Ile or Pro; Xaa at position 343 is Val or Ile; Xaa at position 346 is Pro or Ala; Xaa at position 355 is Val or Ile; Xaa at position 359 is Gly or Ala; Xaa at position 391 is Arg, Glu or Gln; Xaa at position 396 is Ala, Leu, Lys, Asn or Gly; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val or Arg; Xaa at position 402 is Lys, Phe, His, Arg, Gly, Trp, Thr, Asn, Tyr or Met; Xaa at position 403 is Asp or Tyr; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg or Glu; Xaa at position 417 is Ala or Ser; Xaa at position 418 is Leu or Met; Xaa at position 426 is Thr or Ser; Xaa at position 440 is Val or Leu; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu or Ile; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp or Gln; Xaa at position 455 is Val or Ile; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile or Tyr; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala or Ser; Xaa at position 464 is Arg, Gly or His; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 500 is Arg or Gln; Xaa at position 509 is Lys or Gln; Xaa at position 520 is Lys, Gln or Glu; and Xaa at position 527 is Gln, Lys, Pro, Cys or Glu; wherein one or more amino acid(s) designated by Xaa in SEQ ID NO: 4 is an amino acid different from the corresponding amino acid of SEQ ID NO: 35 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO:
 35. 4. The PHI-4 polypeptide of claim 2 or 3, further comprising one or more amino acid substitutions at position 86, 359, 399, 464, 465, 466, 467, 468, 499 or 517 of SEQ ID NO: 3 or SEQ ID NO:
 4. 5. The PHI-4 polypeptide of claim 4, wherein the amino acid at position 86 is Glu or Thr; the amino acid at position 359 is Gly or Ala; the amino acid at position 399 is Gly or Ala; the amino acid at position 464 is Arg, Ala, Lys, Asp or Asn; the amino acid at position 465 is Lys or Met, the amino acid at position 467 is Val, Ala, Leu or Thr; the amino acid at position 468 is Ser or Leu; the amino acid at position 499 is Glu or Ala, or the amino acid at position 517 is Glu or Arg.
 6. The PHI-4 polypeptide of claim 2 or 3, further comprising one or more amino acid substitution, insertion of one or more amino acids, deletion of one or more amino acids, and combinations thereof.
 7. The PHI-4 polypeptide of claim 2 or 3, wherein the insecticidal activity is increased about 2 fold or greater compared to AXMI-205 (SEQ ID NO: 35).
 8. The PHI-4 polypeptide of claim 2 or 3, wherein the insecticidal activity is increased about 5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).
 9. The PHI-4 polypeptide of claim 2 or 3, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is against Western Corn Root Worm (WCRW) larvae.
 10. The PHI-4 polypeptide of claim 2 or 3, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantitated as a Mean FAE Index.
 11. The PHI-4 polypeptide of claim 2 or 3, wherein the improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35) is quantitated as an EC50 value.
 12. The PHI-4 polypeptide of claim 2 or 3, wherein the improved activity compared to AXMI-205 (SEQ ID NO: 35) is quantitated as a Mean Deviation Score.
 13. The PHI-4 polypeptide of claim 2 or 3, having 1 to 54 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or
 4. 14. The PHI-4 polypeptide of claim 2 or 3, having 1 to 15 amino acid substitutions at a position(s) designated as Xaa in SEQ ID NO: 3 or
 4. 15. The PHI-4 polypeptide of claim 2 or 3, wherein 1-20 amino acids are deleted from the C-terminus of the PHI-4 polypeptide.
 16. A polynucleotide encoding a PHI-4 polypeptide, wherein the PHI-4 polypeptide has improved insecticidal activity compared to AXMI-205 (SEQ ID NO: 35).
 17. The polynucleotide encoding a PHI-4 polypeptide of claim 16, wherein the PHI-4 polypeptide comprises an amino acid sequence of the formula, (SEQ ID NO: 3)                 5                   10                  15 Met Xaa Ser Ala Ala Asn Ala Gly Xaa Leu Gly Asn Leu Xaa Gly                 20                  25                  30 Xaa Thr Ser Xaa Gly Met Xaa Tyr Xaa Val Asn Gly Leu Tyr Ala                 35                  40                  45 Ser Pro Glu Ser Leu Xaa Gly Gln Pro Leu Phe Xaa Xaa Gly Gly                 50                  55                  60 Xaa Leu Asp Ser Xaa Xaa Ile Glu Gly Xaa Xaa Xaa Xaa Phe Pro                 65                  70                  75 Xaa Ser Met His Val His Thr Tyr Phe His Ser Asp Xaa Xaa Gln                 80                  85                  90 Xaa Val Ser Xaa Xaa Ile Xaa Xaa Xaa Arg Xaa Xaa Xaa Ser Xaa                 95                  100                 105 His Val Gly Xaa Ser Gly Xaa Xaa Xaa Leu Phe Ser Xaa Ser Xaa                 110                 115                 120 Ser Val Asp Xaa Thr Thr Xaa Xaa Gln Gln Leu Xaa Glu Ile Thr                 125                 130                 135 Xaa Ser Ser Thr Arg Glu Xaa His Val Leu Trp Tyr Ile Ser Leu                 140                 145                 150 Pro Gly Ala Ala Thr Leu Xaa Ser Met Leu Xaa Xaa Xaa Xaa Xaa                 155                 160                 165 Xaa Asp Xaa Xaa Xaa Pro Asn Met Xaa Ala Met Xaa Leu Phe Xaa                 170                 175                 180 Xaa Xaa Gly Pro Xaa Xaa Xaa Xaa Xaa Ala Ala Val Gly Gly Arg                 185                 190                 195 Leu Xaa Xaa Xaa Xaa Ala Ser Lys Xaa Leu Xaa Met Xaa Ser Ser                 200                 205                 210 Xaa Ser Leu Ser Thr Thr Xaa Xaa Xaa Ser Xaa Xaa Ala Xaa Xaa                 215                 220                 225 Gly Glu Ile Xaa Ile Xaa His Gly Ser Xaa Met Glu Lys Gln Val                 230                 235                 240 Asn Ser Phe Xaa Xaa Xaa Ser Thr Ile Arg Xaa Thr Ala Thr Gly                 245                 250                 255 Gly Lys Pro Gly Xaa Thr Xaa Arg Ile Leu His Gly Pro Asp Ser                 260                 265                 270 Xaa Xaa Ala Phe Ser Xaa Trp Ala Xaa Ser Leu Leu Xaa Tyr Ala                 275                 280                 285 Thr Leu Met Asp Phe Xaa Thr Xaa Ser Leu Xaa Xaa Ile Xaa Ala                 290                 295                 300 Leu Xaa Asp Xaa Pro Xaa Xaa Xaa Xaa Glu Xaa Xaa Xaa Ala Xaa                 305                 310                 315 Pro Xaa Xaa Met Xaa Xaa Ser Gln Xaa Ser Ile Pro Xaa Val Asp                 320                 325                 330 Xaa Val Leu Leu Met Asp Ala Arg Pro Pro Met Val Xaa Ala Gly                 335                 340                 345 Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa                 350                 355                 360 Xaa Ser Thr Ser Xaa Xaa Tyr Lys Xaa Xaa Gly Gln Phe Xaa Gln                 365                 370                 375 Arg Xaa His Xaa Ser Val Ala Asp Gly His Xaa Pro Ile Xaa Xaa                 380                 385                 390 Asp Leu Phe Asp Xaa Gly Xaa Xaa Xaa Xaa Pro Val Gly Xaa Gln                 395                 400                 405 Xaa Val Trp Asp Xaa Xaa Xaa Xaa Gly Lys Xaa Xaa Xaa Tyr Xaa                 410                 415                 420 Cys Trp Arg Xaa Xaa Xaa Xaa Gln Gly Tyr Xaa Xaa Xaa Gly Asp                 425                 430                 435 Val Xaa Met Leu Ala Xaa Ser Gly Tyr Asn Pro Pro Asn Leu Pro                 440                 445                 450 Xaa Xaa Xaa Cys Xaa His Xaa Ser Leu Xaa Ala Xaa Xaa Xaa Thr                 455                 460                 465 Leu Xaa Xaa Xaa Xaa Trp Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa                 470                 475                 480 Xaa Val Ser Leu Trp Xaa Pro Gly Ala Ala Gly Ala Val Ala Ser                 485                 490                 495 Ser Cys Phe Ala Gly Val Pro Asn Tyr Asn Asn Pro Pro Asn Ser                 500                 505                 510 Gly Xaa Ile Xaa Xaa Leu Xaa Gly Ser Ile Ala Cys Val Xaa Thr                 515                 520                 525 Ser Ala Ile Ala Ser Met Xaa Xaa Met Xaa Ser Met Leu Ser Xaa                 530                 535 His Xaa Gly Met Glu Ala Met Met Ser Lys Leu,

wherein Xaa at position 2 is Ala or Arg; Xaa at position 9 is Gln, Lys or Glu; Xaa at position 14 is Pro or Ala; Xaa at position 16 is Val or Asp; Xaa at position 19 is Met or Leu; Xaa at position 22 is Gly or Ser; Xaa at position 24 is Asp, Asn or Gln; Xaa at position 36 is Leu or Met; Xaa at position 42 is Asp, Asn or Gln; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu, Asp, Asn or Gly; Xaa at position 50 is Ile or Val; Xaa at position 51 is Glu or Gln; Xaa at position 55 is Arg or Lys; Xaa at position 56 is Ser or Thr; Xaa at position 57 is Tyr or Phe; Xaa at position 58 is Thr or Ser; Xaa at position 61 is Arg, Lys or Glu; Xaa at position 73 is Phe or Tyr; Xaa at position 74 is Lys, Glu, Gly, Arg, Met, Leu, His or Asp; Xaa at position 76 is Asp or Gln; Xaa at position 79 is Lys or Glu; Xaa at position 80 is Glu or Ser; Xaa at position 82 is Glu, Ile, Leu, Tyr or Gln; Xaa at position 83 is Glu or Gln; Xaa at position 84 is Tyr or Phe; Xaa at position 86 is Glu or Gln; Xaa at position 87 is Lys or Gln; Xaa at position 88 is Met, Ile or Leu; Xaa at position 90 is Gln or Glu; Xaa at position 94 is Val or Ile; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln, Gly or Ser; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile, Met, Phe, Cys, Val or Asn; Xaa at position 103 is Ala or Gly; Xaa at position 105 is Leu or Ile; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp, Asn, Glu, Cys, Ala or Arg; Xaa at position 112 is Thr or Ser; Xaa at position 113 is Asp, Glu or Met; Xaa at position 117 is Thr or Ser; Xaa at position 121 is Tyr or Phe; Xaa at position 127 is Ala or Thr; Xaa at position 142 is Arg or Glu; Xaa at position 146 is Arg or Gln; Xaa at position 147 is Arg, Glu or Gln; Xaa at position 148 is Asp, Phe, Pro, Val, Glu, His, Trp, Ala, Arg, Leu, Ser, Gln or Gly; Xaa at position 149 is Phe or Val; Xaa at position 150 is Arg, Gln or Glu; Xaa at position 151 is Asp, Ser, Ala, Asn, Trp, Val, Gln, Cys, Met, Leu, Arg or Glu; Xaa at position 153 is Leu or Ile; Xaa at position 154 is Asn or Asp; Xaa at position 155 is Asn or Lys; Xaa at position 159 is Pro or Asp; Xaa at position 162 is Glu, Asp or Gln; Xaa at position 165 is Lys, Glu, Gln, Pro, Thr, Ala, Leu, Gly, Asp, Val, His, Ile, Met, Trp, Phe, Tyr or Arg; Xaa at position 166 is Arg or Gln; Xaa at position 167 is Tyr, Trp or Cys; Xaa at position 170 is Tyr or His; Xaa at position 171 is Tyr or Phe; Xaa at position 172 is Ile, Leu or Val; Xaa at position 173 is Ser or Ala; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 183 is Tyr or Val; Xaa at position 184 is Ser or Thr; Xaa at position 185 is Ala or Ser; Xaa at position 189 is Thr, Lys or Ile; Xaa at position 191 is Lys or Gln; Xaa at position 193 is Asp or Asn; Xaa at position 196 is Gln, Lys, Asn, Asp, Glu, Ala, Ile or Arg; Xaa at position 202 is Ala or Val; Xaa at position 203 is Glu, Thr or His; Xaa at position 204 is Met or Ala; Xaa at position 206 is Tyr or Phe; Xaa at position 207 is Lys or Gln; Xaa at position 209 is Leu or Pro; Xaa at position 210 is Val or Ile; Xaa at position 214 is Lys, Ser or Gln; Xaa at position 216 is Glu, Gln, Phe, Val, Tyr or Arg; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser, Gln, Arg, Trp, Met, Ala, Phe, Ile, Leu, Cys or Asn; Xaa at position 229 is Arg or Glu; Xaa at position 230 is Ser or Glu; Xaa at position 231 is Asn or Ser; Xaa at position 236 is Leu or Pro; Xaa at position 245 is Met or Leu; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln, Lys or Glu; Xaa at position 257 is Gln, Ile, Glu, Cys, Ser, His, Trp or Met; Xaa at position 261 is Gln, Glu or Lys; Xaa at position 264 is Glu or Gln; Xaa at position 268 is Asp or Asn; Xaa at position 276 is Ser or Ala; Xaa at position 278 is Glu, Asn or Gln; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 282 is Pro or Gly; Xaa at position 284 is Trp or Arg; Xaa at position 287 is Ala or Cys; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr, Asp, Phe, Ser, Met, Arg, Trp, Ile, His, Asn, Cys, Gly or Ala; Xaa at position 291 is Glu or Gln; Xaa at position 292 is Arg or Gln; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 294 is Val or Ala; Xaa at position 296 is Leu or Ile; Xaa at position 297 is Glu or Gln; Xaa at position 298 is Asp or Gln; Xaa at position 300 is Phe or Tyr; Xaa at position 302 is Glu or Gln; Xaa at position 303 is Phe or Tyr; Xaa at position 305 is Lys or Gln; Xaa at position 306 is Gln or Lys; Xaa at position 309 is Gln, Lys or Glu; Xaa at position 313 is Lys, Gln or Arg; Xaa at position 316 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 331 is Glu, Asn or Gln; Xaa at position 333 is Ser, Arg, Gly, Lys, Val, Asn, Ala, His, Gln, Thr, Asp, Ile, Leu, Cys or Glu; Xaa at position 334 is Gly, Arg, Lys, Ile or Trp; Xaa at position 335 is Ser or Ala; Xaa at position 336 is Gly or Ala; Xaa at position 337 is Ala, Val or Gly; Xaa at position 338 is Ser, His, Val, Lys, Ala, Gly, Thr, Ile, Glu, Met, Arg, Pro, Asp, Asn or Leu; Xaa at position 339 is Glu, Asn, Gln, Ile, Pro, Met, Ser, Ala, Cys, Phe, Val, Leu, Asp, Trp, His or Arg; Xaa at position 341 is Leu or Val; Xaa at position 342 is Ala, Ser or Val; Xaa at position 343 is Val or Ile; Xaa at position 344 is Phe or Trp; Xaa at position 345 is Asn or His; Xaa at position 346 is Pro or Ala; Xaa at position 350 is Asn or Ser; Xaa at position 351 is Gly or Val; Xaa at position 354 is Met or Leu; Xaa at position 355 is Val, Ile or Leu; Xaa at position 359 is Gly or Ala; Xaa at position 362 is Asn or Ser; Xaa at position 364 is Ala or Ser; Xaa at position 371 is Ala, Gly or Thr; Xaa at position 374 is Phe or Ile; Xaa at position 375 is Lys or Arg; Xaa at position 380 is Leu or Gly; Xaa at position 382 is Val, Asp or Leu; Xaa at position 383 is Leu, Ile or Val; Xaa at position 384 is Lys, Ala or Gly; Xaa at position 385 is Ala or Gly; Xaa at position 389 is Trp or Tyr; Xaa at position 391 is Arg, Leu, Glu, Gln or Asp; Xaa at position 395 is Asp or Cys; Xaa at position 396 is Ala, Leu, Lys, Asn, Gly, Ile, Met, Arg, Tyr, Gln or His; Xaa at position 397 is Gly, Arg or Ala; Xaa at position 398 is Ser, Gln or Cys; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val, Arg, Ile, Asn, Phe, Thr, Ala, Asp, Met, Gln or Glu; Xaa at position 402 is Lys, Phe, His, Arg, Trp, Gly, Asn, Leu, Tyr, Thr, Val, Met, Pro or Ala; Xaa at position 403 is Asp, Tyr, Trp, Phe or Glu; Xaa at position 405 is Ala or Ser; Xaa at position 409 is Ala or Pro; Xaa at position 410 is Ile or Val; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg, Glu or Gln; Xaa at position 417 is Ala, Ser or Cys; Xaa at position 418 is Leu or Met; Xaa at position 422 is Met or Val; Xaa at position 426 is Thr or Ser; Xaa at position 436 is Asp or Lys; Xaa at position 437 is Tyr or Val; Xaa at position 438 is Val or Arg; Xaa at position 440 is Val or Leu; Xaa at position 442 is Gln, Lys or Glu; Xaa at position 445 is Cys, Leu or Thr; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu, Ile, Gly, Pro, Leu, Phe, Trp or Thr; Xaa at position 448 is Val or Ala; Xaa at position 449 is Gln or Glu; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 453 is Asn or Asp; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp, Gln, Gly, His, Asp, Leu, Thr, Pro or Asn; Xaa at position 455 is Val or Ile; Xaa at position 457 is Trp or Asn; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile, Thr, Ser, His, Cys, Tyr, Pro, Asn, Ala, Arg or Glu; Xaa at position 460 is Gly or Ala; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala, Ser or Gly; Xaa at position 464 is Arg, Gly, His, Gln, Thr or Phe; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly, Arg, Thr, His, Cys, Trp, Phe or Leu; Xaa at position 466 is Asp or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 497 is Asp or Gln; Xaa at position 499 is Glu or Gln; Xaa at position 500 is Arg, Gln or Lys; Xaa at position 502 is Arg, Glu or Gln; Xaa at position 509 is Lys, Gln or Glu; Xaa at position 517 is Gln, Cys, Asn, Val or Pro; Xaa at position 518 is Glu or Gln; Xaa at position 520 is Lys, Gln or Glu; Xaa at position 525 is Gln or Lys; and Xaa at position 527 is Gln, Lys, Pro, Cys, Glu, Ser, His, Phe or Trp; wherein one or more amino acid(s) designated by Xaa in SEQ ID NO: 3 is an amino acid different from the corresponding amino acid of SEQ ID NO: 35 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO:
 35. 18. The polynucleotide encoding a PHI-4 polypeptide of claim 16, wherein the PHI-4 polypeptide comprises an amino acid sequence of the formula, (SEQ ID NO: 4)                 5                   10                  15 Met Xaa Ser Ala Ala Asn Ala Gly Gln Leu Gly Asn Leu Pro Gly                 20                  25                  30 Val Thr Ser Met Gly Met Gly Tyr Xaa Val Asn Gly Leu Tyr Ala                 35                  40                  45 Ser Pro Glu Ser Leu Leu Gly Gln Pro Leu Phe Xaa Xaa Gly Gly                 50                  55                  60 Xaa Leu Asp Ser Ile Glu Ile Glu Gly Arg Ser Tyr Thr Phe Pro                 65                  70                  75 Arg Ser Met His Val His Thr Tyr Phe His Ser Asp Phe Xaa Gln                 80                  85                  90 Asp Val Ser Xaa Glu Ile Xaa Glu Tyr Arg Glu Lys Met Ser Gln                 95                  100                 105 His Val Gly Val Ser Gly Xaa Xaa Xaa Leu Phe Ser Ala Ser Leu                 110                 115                 120 Ser Val Asp Xaa Thr Thr Thr Asp Gln Gln Leu Thr Glu Ile Thr                 125                 130                 135 Tyr Ser Ser Thr Arg Glu Ala His Val Leu Trp Tyr Ile Ser Leu                 140                 145                 150 Pro Gly Ala Ala Thr Leu Arg Ser Met Leu Arg Xaa Xaa Phe Xaa                 155                 160                 165 Xaa Asp Xaa Asn Asn Pro Asn Met Pro Ala Met Xaa Leu Phe Xaa                 170                 175                 180 Xaa Tyr Gly Pro Tyr Xaa Ile Ser Xaa Ala Ala Val Gly Gly Arg                 185                 190                 195 Leu Xaa Tyr Ser Ala Ala Ser Lys Thr Leu Lys Met Asp Ser Ser                 200                 205                 210 Xaa Ser Leu Ser Thr Thr Ala Xaa Met Ser Xaa Lys Ala Leu Val                 215                 220                 225 Gly Glu Ile Lys Ile Xaa His Gly Ser Xaa Met Glu Lys Gln Val                 230                 235                 240 Asn Ser Phe Arg Ser Asn Ser Thr Ile Arg Leu Thr Ala Thr Gly                 245                 250                 255 Gly Lys Pro Gly Met Thr Xaa Arg Ile Leu His Gly Pro Asp Ser                 260                 265                 270 Xaa Xaa Ala Phe Ser Xaa Trp Ala Glu Ser Leu Leu Asp Tyr Ala                 275                 280                 285 Thr Leu Met Asp Phe Ser Thr Xaa Ser Leu Xaa Pro Ile Trp Ala                 290                 295                 300 Leu Ala Asp Xaa Pro Glu Arg Xaa Val Glu Leu Glu Asp Ala Phe                 305                 310                 315 Pro Glu Phe Met Lys Gln Ser Gln Gln Ser Ile Pro Xaa Val Asp                 320                 325                 330 Lys Val Leu Leu Met Asp Ala Arg Pro Pro Met Val Xaa Ala Gly                 335                 340                 345 Glu Asp Xaa Xaa Ser Xaa Ala Xaa Xaa Asp Leu Ala Xaa Phe Asn                 350                 355                 360 Xaa Ser Thr Ser Asn Gly Tyr Lys Met Xaa Gly Gln Phe Xaa Gln                 365                 370                 375 Arg Asn His Ala Ser Val Ala Asp Gly His Ala Pro Ile Phe Lys                 380                 385                 390 Asp Leu Phe Asp Leu Gly Val Leu Lys Ala Pro Val Gly Trp Gln                 395                 400                 405 Xaa Val Trp Asp Asp Xaa Gly Ser Gly Lys Xaa Xaa Xaa Tyr Ala                 410                 415                 420 Cys Trp Arg Ala Ile Xaa Xaa Gln Gly Tyr Xaa Xaa Xaa Gly Asp                 425                 430                 435 Val Met Met Leu Ala Xaa Ser Gly Tyr Asn Pro Pro Asn Leu Pro                 440                 445                 450 Asp Tyr Val Cys Xaa His Gln Ser Leu Cys Ala Xaa Val Gln Thr                 455                 460                 465 Leu Xaa Asn Xaa Xaa Trp Trp Asp Xaa Gly Xaa Xaa Xaa Xaa Xaa                 470                 475                 480 Asp Val Ser Leu Trp Xaa Pro Gly Ala Ala Gly Ala Val Ala Ser                 485                 490                 495 Ser Cys Phe Ala Gly Val Pro Asn Tyr Asn Asn Pro Pro Asn Ser                 500                 505                 510 Gly Asp Ile Glu Xaa Leu Arg Gly Ser Ile Ala Cys Val Xaa Thr                 515                 520                 525 Ser Ala Ile Ala Ser Met Gln Glu Met Xaa Ser Met Leu Ser Gln                 530                 535 His Xaa Gly Met Glu Ala Met Met Ser Lys Leu,

wherein Xaa at position 2 is Ala or Arg; Xaa at position 24 is Asp or Asn; Xaa at position 42 is Asp or Asn; Xaa at position 43 is Phe or Glu; Xaa at position 46 is Glu or Asn; Xaa at position 74 is Lys, Glu or Gly; Xaa at position 79 is Lys or Glu; Xaa at position 82 is Glu, Ile, Leu or Tyr; Xaa at position 97 is Arg, Asn, Asp, Glu, Gln or Gly; Xaa at position 98 is Tyr or Phe; Xaa at position 99 is Lys, Leu, Tyr, Ile or Met; Xaa at position 109 is Phe, Lys, Gly, Met, Ser, Asp or Asn; Xaa at position 147 is Arg or Glu; Xaa at position 148 is Asp, Phe or Pro; Xaa at position 150 is Arg or Gln; Xaa at position 151 is Asp, Ser, Ala or Asn; Xaa at position 153 is Leu or Ile; Xaa at position 162 is Glu or Gln; Xaa at position 165 is Lys, Glu or Gln; Xaa at position 166 is Arg or Gln; Xaa at position 171 is Tyr or Phe; Xaa at position 174 is Glu or Gln; Xaa at position 182 is Asp or Gln; Xaa at position 196 is Gln, Lys, Asn or Asp; Xaa at position 203 is Glu, Thr or His; Xaa at position 206 is Tyr or Phe; Xaa at position 216 is Glu or Gln; Xaa at position 220 is Glu, His, Asp, Thr, Tyr, Val, Ser or Gln; Xaa at position 247 is Asp or Tyr; Xaa at position 256 is Gln or Lys; Xaa at position 257 is Gln or Ile; Xaa at position 261 is Gln or Glu; Xaa at position 278 is Glu or Asn; Xaa at position 281 is Gln, Lys or Glu; Xaa at position 289 is Lys, Leu, Val, Pro, Glu, Gln, Tyr, Thr or Asp; Xaa at position 293 is Arg, Glu or Gln; Xaa at position 313 is Lys or Gln; Xaa at position 328 is Lys, Glu or Gln; Xaa at position 333 is Ser, Gly, Lys, Val or Asn; Xaa at position 334 is Gly, Arg, Lys or Ile; Xaa at position 336 is Gly or Ala; Xaa at position 338 is Ser, His, Val, Lys or Ala; Xaa at position 339 is Glu, Asn, Ile or Pro; Xaa at position 343 is Val or Ile; Xaa at position 346 is Pro or Ala; Xaa at position 355 is Val or Ile; Xaa at position 359 is Gly or Ala; Xaa at position 391 is Arg, Glu or Gln; Xaa at position 396 is Ala, Leu, Lys, Asn or Gly; Xaa at position 401 is Ser, His, Pro, Gly, Lys, Val or Arg; Xaa at position 402 is Lys, Phe, His, Arg, Gly, Trp, Thr, Asn, Tyr or Met; Xaa at position 403 is Asp or Tyr; Xaa at position 411 is Pro or Ala; Xaa at position 412 is Pro or Ala; Xaa at position 416 is Arg or Glu; Xaa at position 417 is Ala or Ser; Xaa at position 418 is Leu or Met; Xaa at position 426 is Thr or Ser; Xaa at position 440 is Val or Leu; Xaa at position 447 is Asp, Lys, Tyr, Ser, Glu or Ile; Xaa at position 452 is Gln, Lys or Glu; Xaa at position 454 is Arg, Tyr, Met, Ser, Val, Ile, Lys, Phe, Trp or Gln; Xaa at position 455 is Val or Ile; Xaa at position 459 is Lys, Met, Val, Trp, Gln, Ile or Tyr; Xaa at position 461 is Thr or Ser; Xaa at position 462 is Gly or Ala; Xaa at position 463 is Ala or Ser; Xaa at position 464 is Arg, Gly or His; Xaa at position 465 is Lys, Asn, Val, Met, Pro, Gly or Arg; Xaa at position 471 is Gln, Lys or Glu; Xaa at position 500 is Arg or Gln; Xaa at position 509 is Lys or Gln; Xaa at position 520 is Lys, Gln or Glu; and Xaa at position 527 is Gln, Lys, Pro, Cys or Glu; wherein one or more amino acid(s) designated by Xaa in SEQ ID NO: 4 is an amino acid different from the corresponding amino acid of SEQ ID NO: 35 and wherein the PHI-4 polypeptide has increased insecticidal activity compared to SEQ ID NO:
 35. 19. The polynucleotide of claim 17 or 18, wherein the PHI-4 polypeptide further comprises one or more amino acid substitutions at position 86, 359, 399, 464, 465, 466, 467, 468, 499 or 517 of SEQ ID NO: 3 or SEQ ID NO:
 4. 20. The polynucleotide of claim 19, wherein in the PHI-4 polypeptide the amino acid at position 86 is Glu or Thr; the amino acid at position 359 is Gly or Ala; the amino acid at position 399 is Gly or Ala; the amino acid at position 464 is Arg, Ala, Lys, Asp or Asn; the amino acid at position 465 is Lys or Met, the amino acid at position 467 is Val, Ala, Leu or Thr; the amino acid at position 468 is Ser or Leu; the amino acid at position 499 is Glu or Ala, or the amino acid at position 517 is Glu or Arg.
 21. The polynucleotide of claim 17 or 18, wherein the PHI-4 polypeptide further comprises one or more amino acid substitution, insertion of one or more amino acids, deletion of one or more amino acids, and combinations thereof.
 22. The polynucleotide of claim 17 or 18, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 2 fold or greater compared to AXMI-205 (SEQ ID NO: 35).
 23. The polynucleotide of claim 17 or 18, wherein the insecticidal activity of the PHI-4 polypeptide is increased about 5 fold or greater compared to AXMI-205 (SEQ ID NO: 35).
 24. A composition, comprising an insecticidally-effective amount of the PHI-4 polypeptide of claim
 1. 25. A method of inhibiting growth or killing an insect pest, comprising contacting the insect pest with the composition of claim
 24. 26. A method for controlling an insect pest population resistant to a pesticidal protein, comprising contacting the resistant insect pest population with the composition of claim
 1. 27. The method of controlling an insect pest population resistant to an pesticidal protein of claim 1, wherein the pesticidal protein is selected from Cry1Ac, Cry1Ab, Cry1A.105, Cry1Ac, Cry1F, Cry1Fa2, Cry1F, Cry2Ab, Cry3A, mCry3A, Cry3Bb1, Cry34Ab1, Cry35Ab1, Vip3A, Cry9c, eCry3.1Ab and CBI-Bt.
 28. A transgenic plant or progeny thereof, comprising the polynucleotide of claim
 16. 29. The transgenic plant or progeny thereof of claim 28, wherein the transgenic plant is a monocotyledon.
 30. The transgenic plant or progeny thereof of claim 29, further comprising one or more additional transgenic traits.
 31. Seed, grain or processed product thereof of the transgenic plant of claim 28, wherein the seed, grain, or processed product thereof comprises the polynucleotide of any one of claim
 16. 32. An expression cassette, comprising the polynucleotide of claim 16 operably linked to one or more regulatory sequences directing expression of the PHI-4 polypeptide.
 33. A transgenic plant or plant cell, comprising the expression cassette of claim
 32. 34. A method for protecting a plant from an insect pest, comprising expressing in the plant or cell thereof, an insecticidally-effective amount of the PHI-4 polypeptide of claim
 1. 35. A method for controlling an insect pest population, comprising contacting the insect pest population with an insecticidally-effective amount of the PHI-4 polypeptide of claim
 1. 36. A method of inhibiting growth or killing an insect pest, comprising contacting the insect pest with a composition comprising an insecticidally-effective amount of the the PHI-4 polypeptide of claim
 1. 37. A method for controlling an insect pest population resistant to a pesticidal protein, comprising contacting the insect pest population with an insecticidally-effective amount of the PHI-4 polypeptide of claim
 1. 38. A fusion protein comprising the PHI-4 polypeptide of claim
 1. 